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Preface 


Memory can be typically defined as the brain function enabling the encoding, storage 
and retrieval of sensory information. In operational terms, this definition implies 
that our central nervous system not only processes various sensory modalities, be they 
visual, tactile, auditory, olfactory or gustatory, but is also capable of forming, organizing 
and conserving memory traces for extended periods of time. At both psychological 
and physiological levels, there is now a consensus that memory must no longer be seen 
as a unitary phenomenon but rather as an ensemble of dynamic processes, each one 
being subserved by different brain regions organized into multiple memory systems 
that support different forms of memory and that constantly interact to ensure optimal 
performance during any given cognitive challenge. 

Despite remarkable progress achieved over the last 30 years in our understanding 
of the neural bases of cognitive processes and associated pathologies, the questions of 
how, where and when memory traces are formed in the brain remain central issues and 
continue to fuel much debate in the field of cognitive neuroscience. How do neuronal 
systems encode, consolidate and retrieve memory? How are memories embedded into 
complex neuronal networks? How do molecular mechanisms modulate the neuronal 
plasticity and functioning of these networks during memory processing? What are the 
fundamental units of computation in the brain? Thanks to the development of novel 
approaches, including transgenic techniques, functional brain imaging, multiple cell 
recording, functional genomics and proteomics, the last decade has witnessed dramatic 
advances in the neurobiology, neuroanatomy, neurophysiology and neuropathology of 
learning and memory processes. The main aim of the Colloque Médecine et Recherche, 
titled “Memories: Molecules and Circuits,” organized by the Fondation Ipsen in Paris 
on April 24, 2006, and upon which the present set of chapters is based, was to survey 
these recent advances and to provide an integrative view of molecular, cellular, and 
systems level mechanisms underlying cognitive processes in animals, nonhuman pri- 
mates and humans. During the conference, current state of the art and future avenues 
were discussed by distinguished speakers who provided not only an overview of the 
underlying neurobiology of cognitive processes from a basic science standpoint but 
also focused on clinical and therapeutic aspects surrounding impairments associated 
with disorders that affect cognition. Views from these different speakers are reflected 
here in the form of eleven chapters that tackle key issues in the field of cognitive 
neuroscience. 

Studies of brain-damaged patients have provided remarkable insights into the neu- 
roanatomy of memory and into how memories are organized and stored in the brain. 
One chapter illustrates this line of studies and focuses on the role of the medial tempo- 
ral lobe, including the hippocampus, with respect to different memory forms such as 
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declarative memory (i.e., conscious memory for facts and events) and non declarative 
memory (i.e., unconscious memory for skills and habits that is independent of the 
medial temporal lobe). Interestingly, the data reviewed point to a time-limited role for 
the hippocampal region in remote memory storage. While the medial temporal lobe is 
crucial for acquiring new memories, it does not appear to be required to recollect de- 
tailed autobiographical memories acquired long ago, this function being progressively 
subsumed by extrahippocampal structures, namely widespread cortical regions, as 
memories mature over time. Converging evidence has also been found in experimental 
animals, as detailed in another chapter that focuses on the dynamics of hippocampal- 
cortical interactions during the course of the memory consolidation process that are 
required for the formation of enduring memories. Functional brain imaging studies 
alongside pharmacological approaches and traditional lesion techniques have provided 
new breakthroughs in the identification of the brain regions underlying the organiza- 
tion of recent and remote memories. While the neuronal mechanisms driving memory 
consolidation continue to be hotly debated, it is now acknowledged that the formation 
of long-lasting memories is no longer to be seen as a simple passive phenomenon but 
rather as a dynamic process of continuous neuronal reorganization as new memories 
are being processed and subsequently stored. 

How can we unravel the cellular and molecular mechanisms underlying the for- 
mation of enduring memories? Performing genetic manipulations to control gene 
expression is certainly one powerful approach to dissect the plethora of signaling path- 
ways that lead to memory formation. One chapter illustrates this type of approach and 
provides convincing evidence that Drosophila constitutes a valid and integrative insect 
model to study some of the cellular mechanisms involved in normal or pathological 
human memory. As highlighted in another chapter centered on the molecular bases of 
remote memory in the mouse, experimental studies have started to successfully identify 
specific mechanisms and underlying molecules that govern structural synaptic plastic- 
ity at both the hippocampal and cortical levels, as memories progressively gain stability. 

What happens when remotely acquired memories are reactivated? Memories may 
never be stored in a definitive form but continuously reorganized and restabilized 
each time they are retrieved. In this respect, the very term “consolidation” may be 
misleading. Indeed, upon reactivation, cortical memories have been shown to become 
labile again and to require another round of protein synthesis to regain stability. Two 
chapters tackle this critical issue and provide exciting evidence that disruption of the 
so-called process of “memory reconsolidation” may be a valid therapeutic strategy 
for treating maladaptive memory disorders such as post-traumatic stress and drug 
addiction. 

How are memories encoded and stored? Such processes necessarily involve the 
constitutive units of the brain, namely neurons, which are organized into complex, in- 
terconnected cell assemblies. Plasticity phenomena in the form of changes in synaptic 
efficacy within existing synapses or synthesis of novel synapses can readily occur within 
these cell assemblies. Significant advances have been achieved in the search of the neu- 
ronal code of memories using large-scale electrophysiological techniques that allow 
for the recording of multiple neuronal ensembles (up to 260 neurons) simultaneously 
in the freely moving rodent or nonhuman primate confronted with various cognitive 
challenges. Three chapters in this book provide representative examples of this kind of 
approach which, from a conceptual point of view, highlight the high selectivity, hierar- 
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chy and cooperation between hippocampal and cortical neurons in identifying specific 
stimuli, analyzing the outcome (success or failure) of the behavioral response, and con- 
verting daily experiences into various memory forms. Highly integrative properties 
of hippocampal-cortical cell assemblies that can act as basic coding units provide a 
network mechanism for the brain to achieve large memory storage capacity and higher 
cognitive functions, such as association, abstraction or generalization, as well as the 
representation of space in terms of position, direction and velocity. 

How is our brain capable of coordinating thoughts and action in relation to internal 
goals? As reviewed in another chapter, one specific region in the brain, the prefrontal 
cortex, seems to play a privileged role in exerting this type of cognitive control. This 
region appears to subserve higher-order knowledge and is particularly well suited for 
acquiring abstract rules and concepts needed to guide complex, goal-directed behaviors 
of the kind observed in nonhuman primates and humans. 

The growing knowledge of the cellular and molecular mechanisms underlying 
memory function also has opened the door to new therapeutic strategies for the 
treatment of neuropsychiatric disorders. One last chapter nicely illustrates such novel 
strategies by pointing to new molecules that disrupt memory circuits in Alzheimer’s 
disease and by identifying novel targets for alleviating the cognitive deficits associated 
with this neurodegenerative disease. 

Overall, we hope that the different chapters in this book will offer a comprehensive 
and up-to-date overview of the current knowledge about the neuroanatomy of mem- 
ory and its underlying cellular and molecular mechanisms. Clearly, mechanisms that 
govern memory processes are incredibly complex and a vast array of questions still 
remains to be addressed. However, the set of chapters presented here also demonstrate 
that the field of memory research is moving forward at a rapid pace. By increasingly 
relying on integrative approaches that involve cross-species analyses and a combina- 
tion of multiple levels of analyses, ranging from innovative cognitive paradigms to 
state-of-the-art cellular and molecular techniques, there is no doubt that ongoing and 
future studies will continue to be successful in shedding new light on this fascinating 
field of research. 

Finally, we would like to take this opportunity to thank Jacqueline Mervaillie, San- 
dra Marchand and the other contributing members of the Fondation Ipsen for their 
outstanding skills in the organization of this Colloque Médecine et Recherche. We are 
also grateful to Mary Lynn Gage for her excellent editorial assistance during the prepa- 
ration of this book. 


Bruno BONTEMPI 
Alcino J. SILVA 
Yves CHRISTEN 
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The Neuroanatomy and Neuropsychology 
of Declarative and Nondeclarative Memory 


Peter J. Bayley” and Larry R. Squire!?>* 


The hippocampus and anatomically related structures in the medial temporal lobe 
support the capacity for declarative memory (Squire and Zola-Morgan 1991; Eichen- 
baum and Cohen 2001). Declarative memory refers to the capacity to recollect facts 
and events and has the defining features of being consciously accessible and flexible 
in its expression. Declarative memory can be contrasted to a collection of nondeclar- 
ative forms of memory that includes skills and habits, priming, and simple forms 
of conditioning. These forms of memory are supported by brain structures outside 
the medial temporal lobe. Nondeclarative memory is expressed through performance 
rather than recollection and what is learned is nonconscious and dependent on the 
original learning conditions for full expression. 

This chapter describes recent studies that illuminate the function of the medial 
temporal lobe and the relationship between declarative and nondeclarative memory. 
Three findings will be emphasized: 1) the capacity for learning new factual informa- 
tion (semantic memory) is always dependent on the medial temporal lobe; 2) remote 
memory for autobiographical events (episodic memory) is independent of the medial 
temporal lobe; and 3) humans have a robust capacity for habit memory that can operate 
independently of declarative memory and independently of the medial temporal lobe. 


Failure to acquire new semantic knowledge in patients 
with large medial temporal lobe lesions 


Semantic memory refers to the factual information that we acquire about the world. 
It has been observed that the ability to acquire new semantic memory is impaired 
following medial temporal lobe damage. Thus, patients with damage limited to the 
hippocampal region have moderate difficulty in acquiring new semantic memory, 
whereas patients with damage extending beyond the hippocampus into adjacent medial 
temporal lobe structures have a more severe impairment. From these observations, one 
could conclude that the ability to acquire new semantic knowledge is fully dependent 
upon the medial temporal lobe and that complete lesions to this region should abolish 
new semantic learning. However, a second possibility is that some semantic learning can 
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proceed independently of the medial temporal lobe. Specifically, it has been suggested 
that new semantic information could be acquired directly by the neocortex (Tulving 
1991; McClelland et al. 1995). Although the specific circumstances under which this 
cortical learning might occur have not been defined, it is thought that cortical learning 
should be slow and that considerable repetition and an extended period of learning 
would be required. It has been difficult to decide between these two possibilities because 
most amnesic patients available for study have some remaining medial temporal lobe 
tissue that may be sufficient to support new semantic learning. 

We have had the opportunity to address this question by studying two patients 
(E.P. and G.P.), both of whom are severely amnesic as a result of nearly complete 
medial temporal lobe lesions bilaterally due to herpes simplex encephalitis (Bayley 
and Squire 2005). Both patients have extensive, virtually complete bilateral damage 
to the hippocampus, amygdala, entorhinal cortex, and perirhinal cortex, as well as 
approximately 70% of the parahippocampal cortex. The damage also extends beyond 
the medial temporal lobe in both patients to involve portions of the anterior insular 
cortex and the anterior fusiform gyrus. Both patients perform at chance on formal 
tests of new learning ability, for example, recall and recognition of word lists, stories, 
and diagrams (Stefanacci et al. 2000; Levy et al. 2004). Despite their profound memory 
impairments, both patients score in the normal range on tests of general intelligence and 
also perform normally on difficult tests of visual perceptual discrimination, including 
tests of the ability to discriminate among feature-ambiguous stimuli (Stark and Squire 
2000; Shrager et al. 2006). If the capacity for new semantic learning is fully dependent 
on the medial temporal lobe, then patients E.P. and G.P. should not have acquired new 
semantic knowledge since the onset of their amnesia in 1992 and 1987, respectively. 
Alternatively, if some learning of facts can occur directly in neocortex, then E.P. and 
G.P. should demonstrate evidence of having acquired some new knowledge. 

Five tests were given to assess what new knowledge might have been acquired 
since the onset of amnesia. In the first test, the patients and eight controls were asked 
20 questions that were constructed to be so easy that controls would achieve near- 
perfect scores. Some of the questions related to specific events (e.g., “What crime 
was O.J. Simpson charged with?”), and other questions related to broader semantic 
knowledge (e.g., “What is the Internet?” and “What is the new European currency 
called?”). Testing occurred first in a free-recall format and then in a three-alternative, 
multiple-choice format. The results are shown in Fig. 1A. As intended, the controls 
scored nearly perfectly (mean = 90 + 3% correct). In contrast, each patient answered 
only two questions correctly. The controls also performed well on the multiple-choice 
version of the test (97 + 2% correct; Fig. 1B). In contrast, the patients each scored 55% 
correct, a score not reliably above the chance score of 33% (binomial test, P = 0.18). 
Both patients were questioned about their correct answers, but neither patient was 
able to provide correct information, and it seemed possible that their correct answers 
may have been “well-educated guesses”. For example, on the recall test, both patients 
stated correctly that O.J. Simpson was charged with murder (“What crime was O.J. 
Simpson charged with?”). When E.P. was asked for more information, he stated, “No, 
Tm blank.” G.P. stated that O.J. Simpson was associated with “murder, his wife,” and 
then incorrectly added “I think he is in jail” and “I think she was shot. She was not 
strangled or stabbed.” In addition, on the multiple-choice test, E.P. and G.P. correctly 
chose Afghanistan as the country that was invaded in an effort to capture Osama 
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Fig. 1. Twenty easy facts. Patients E.P., G.P., and controls (CON) were given 20 questions about 
information that could only have been acquired after 1988. (A) Percent correct recall. (B) Percent 
correct responses on a three-alternative, forced-choice recognition test. Brackets show standard 
error of the mean, and the dashed line indicates chance performance (33%) 


Bin Laden (alternatives, Algeria and Colombia). However, neither patient was able 
to provide any information about who this person was or why he had become well 
known. 

Patients and controls were also given a news events test in which 55 questions were 
asked about notable events that had been in the news from 1990 to 2000 (e.g., “What 
is the name of the Russian leader who succeeded Boris Yeltsin?”). Testing was given 
first in a free-recall format and then in a four-alternative, multiple-choice format. The 
controls scored 51 + 6% correct on the news events recall test (Fig. 2A). In contrast, 
patients E.P. and G.P. performed poorly (1.8% and 3.6% correct, respectively). Again, 
the patients were not able to provide any correct details about their correct answers 
and, in some instances, appeared to be relying on semantic knowledge gained early in 
life. Thus, when E.P. was asked which country invaded Kuwait, he replied “Iran and 
Iraq.” When asked to choose, he said “Iraq” and expressed surprise when told that 
his answer was correct. The controls scored 85 + 3% correct on the recognition test, 
whereas both patients performed near the chance score of 25% (E.P., 32% correct; G.P., 
26% correct, P > 0.10; Fig. 2B). Again, the patients knew little, if anything, about their 
few correct answers. 

The patients and controls were also given a test of famous faces, consisting of 24 
photographs of famous people who came into the news in the 1990s. The test was given 
first in a free-recall format (“Who is this famous person?” e.g., Tiger Woods, Bob Dole). 
After the recall test, they were shown each photograph that they could not identify and 
asked either a yes-no question (e.g., “Is this person’s name Tiger Woods?” The name 
that was presented was correct half the time) or a three-alternative, multiple-choice 
question (e.g., “Which of the following is the correct name?” Bob Dole, Newt Gingrich, 
Ross Perot). Controls scored 58 + 8% correct on the famous faces recall test (Fig. 3A). 
In contrast, E.P. failed to recall a single name, and G.P. identified only one face correctly 
out of 24 (Colin Powell). On the recognition test (Fig. 3B), the controls scored 91 + 2% 
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Fig. 2. News events. Patients E.P., G.P., and controls (CON) were given 55 questions about news 
events that occurred from 1990 to 2000. (A) Percent correct recall. (B) Percent correct responses 
on a four-alternative, forced-choice recognition test. Brackets show standard error of the mean, 
and the dashed line indicates chance performance (25%) 
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Fig. 3. Famous faces. Patients E.P., G.P., and controls (CON) were shown photographs of 24 
persons who had become famous during the 1990s. (A) Percent correct recall of the name of 
the person. (B) Percent correct responses for recognition (both three-alternative, forced-choice 
recognition and yes/no recognition). Brackets show standard error of the mean, and the dashed 
line indicates chance performance (42%) 


correct. In contrast, both patients performed near the chance score of 42% (E.P. 33% 
correct, P > 0.10; G.P., 54% correct, P > 0.10). Again, the patients could provide no 
accurate information about their few correct answers. 
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Participants were next given a test of famous persons who were either living or had 
died recently. For this test, participants were read aloud the names of famous (n = 126) 
and fictitious (n = 126) people, one at a time. All the famous people had become famous 
by 1970, and about one half (56%) of these individuals had died between 1990 and 2001. 
Participants first judged whether each name was the name of a famous person. Then, 
for each person correctly judged to be famous, participants were asked whether the 
person was still living or was deceased. To eliminate the use of age as a cue, the average 
age of the famous people who were living was the same as the age that the deceased 
people would have reached had they still been living. When judging whether a name 
was famous or not famous, controls scored 96 + 1% correct, and patients E.P. and G.P. 
scored 73% and 78% correct, respectively (d’ discriminability score = 3.8 + 0.6 for 
controls; 1.2 for E.P. and 1.5 for G.P.). Figure 4 shows performance when participants 
were asked to judge whether the persons correctly judged to be famous were still living. 
Controls scored 73 + 3% correct. In contrast, E.P. and G.P. scored near chance, 53% 
and 50% correct, respectively (chance = 50%). Thus the patients had virtually no 
knowledge of which famous persons were living and which famous persons had died 
during the past 10 years. 

It could be argued that the failure of the patients to show new semantic learning is 
due to the fact that they have had limited exposure to sources of information about the 
world since they became amnesic and that the controls have had much more access to 
information during the corresponding time period. In the final test of new semantic 
learning, the patients were asked to draw a floor plan of their current residence from 
memory. Both patients had moved into their current residence (E.P. in 1993, G.P. in 
2003) after the onset of their amnesia. Thus, each patient must have had thousands of 
opportunities to learn about the layout of their house, and any knowledge of the floor 
plan would represent new learning. Figure 5 shows the drawings made by E.P. and G.P. 
E.P’s drawing correctly indicated the overall orientation of his house relative to the front 
door and showed correctly that the house was divided by a central corridor. However, 
the location of individual rooms was highly inaccurate. G.P’s drawing appeared to be 
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Fig.4. Living or nonliving famous persons. Patients 
60 E.P., G.P., and controls (CON) were first presented 
with the names of 126 people who had become fa- 
mous before 1970 and 126 not famous names and 
40 asked to judge which persons were famous. For each 
person correctly judged to be famous, they were 
then asked whether that person was still living (44% 
20 of the famous persons were still living, and 56% had 
died between 1990 and 2001). The bars show the 
percent correct score for the living/nonliving judg- 
ment. Brackets show standard error of the mean, 
CON EP GP and the dashed line indicates chance performance 
(n=8) (50%) 
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Fig. 5. Home floor plan. Patients E.P. (top) and G.P. (bottom) were asked to draw a floor plan 
of their current home from memory. Both patients had moved to their current homes after the 
onset of their amnesia. An accurate floor plan is shown alongside each patient’s drawing 


even less accurate. The overall shape of the residence was incorrect relative to the front 
door. The location of rooms was highly inaccurate. 

Together, these findings indicate that patients E.P. and G.P. have acquired little, if 
any, new semantic knowledge about the world since the onset of their amnesia. Thus, 
the ability to acquire conscious knowledge across widely varying learning conditions 
appears to be dependent on the medial temporal lobe. This finding suggests that, when 
amnesic patients with extensive medial temporal lobe damage demonstrate some new 
semantic learning after the onset of their amnesia (Kitchener et al. 1998; O’Kane et 
al. 2004; Verfaellie et al. 2000, Westmacott and Moscovitch 2001), the new learning is 
supported by medial temporal lobe structures that remain intact. 

The question arises as to why intact nondeclarative memory did not support the 
learning of some new semantic information. For example, in a previous study, E.P. was 
found to be capable of learning fact-like knowledge when given extensive training over 
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12 weeks (Bayley and Squire 2002). Similarly, other amnesic patients have demonstrated 
some capacity for new learning using a similar training schedule (Stark et al. 2005). The 
answer appears to be that nondeclarative memory can proceed under highly structured 
learning circumstances, in which stimuli are presented repeatedly in exactly the same 
way each time. However, nondeclarative memory is unsuited to support learning under 
the highly variable conditions that are encountered outside the laboratory. 


Spared memory for remote autobiographical events following 
damage to the medial temporal lobe 


The work summarized in the previous section demonstrated that patients with near 
complete damage to the medial temporal lobe are unable to acquire new semantic 
knowledge. Nevertheless, these patients have a rich store of semantic memories that 
were acquired from their early life, before they became amnesic. This section summa- 
rizes recent work that has assessed the status of remote autobiographical (episodic) 
memories of patients E.P. and G.P. and other patients with damage thought to be re- 
stricted to the hippocampal region (hippocampus proper, dentate gyrus, and subicular 
complex). 

Earlier studies suggested that remote autobiographical memories were intact after 
damage to the hippocampal region and surrounding cortex. For example, patient R.B., 
who had histologically identified lesions restricted to the CA1 field of the hippocampus 
bilaterally, was as good as controls at recalling autobiographical memories from his 
earlier life (Zola-Morgan et al. 1986). Similarly, patients L.M. and W.H. were unimpaired 
when asked to recall remote autobiographical memories (MacKinnon and Squire 1989). 
These two patients had histologically identified damage to all the CA fields of the 
hippocampus, as well as damage to the dentate gyrus, the subiculum (W.H. only), and 
some cell loss in the entorhinal cortex (Rempel-Clower et al. 1996). In these studies, 
the autobiographical memories were assessed using a 0-3 scoring system. In contrast 
to these findings, other patients have been described who appear to have difficulty 
recalling autobiographical episodes from all periods of their life, including the most 
remote time periods tested (Hirano and Noguchi 1998; Moscovitch et al. 2000; Cipolotti 
et al. 2001). Such findings raise the possibility that autobiographical memory is always 
dependent on the medial temporal lobe and that medial temporal lobe structures 
are always necessary for recollecting the richness of detail that is the hallmark of 
autobiographical memory. 

There appear to be two ways to reconcile the available data. One possibility is that 
an impairment of remote autobiographical memory was not detected in our earlier 
studies because we used only a simple 0-3 scoring system. A patient’s narrative could 
receive a full score of 3, yet still be impoverished in detail. A second possibility is that 
those patients who are reported to perform poorly on tests of remote autobiographical 
memory might have significant damage to critical brain structures outside the medial 
temporal lobe. We assessed these possibilities in two studies. In the first study (Bayley 
et al. 2003), we carried out a detailed analysis of the narrative content of remote 
autobiographical memories elicited from six well-characterized amnesic patients who 
have damage limited to the hippocampal region and two patients (E.P. and G.P.) who 
(as described earlier) have extensive damage to the medial temporal lobe. In a second 
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study (Bayley et al. 2005b), we carried out a detailed volumetric analysis of structural 
brain images from eight memory-impaired patients. Five of the patients have damage 
limited to the medial temporal lobe, and three have medial temporal lobe damage 
plus significant additional damage to neocortex. The aim of this second study was to 
identify anatomical differences between patients who could successfully recall remote 
autobiographical memories and those who could not. 

In the first study (Bayley et al. 2003), eight patients were each asked to recollect 
autobiographical memories from the first third of their lives before the onset of their 
amnesia (on average, before the age of 16 years; Fig. 6). Twenty-five age-matched 
controls were also asked to recollect memories from the corresponding portion of 
their early lives. Each participant was given 24 cue words, one at a time (e.g., river, bird, 
nail), and asked to produce a memory involving the cue word that was specific in time 
and place (Crovitz and Schiffman 1974). Participants were prompted to elicit as much 
detail as possible. The responses were tape-recorded, and those responses that involved 
a specific event were later scored according to how many details were contained in each 
narrative. More than 700 narratives were evaluated. Both the patients and controls 
were able to provide specific recollections in response to the cue words (patients, 
89.2% of the words; controls 95.3%; P > 0.10). Two categories of detail were defined: 
episodic detail and semantic detail. An episodic detail was specific to an event (e.g., 
the outcome of a particular sporting event played at school one day). In contrast, 
a semantic detail was not unique to the event being recollected but was nonetheless 
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Fig. 6. A timeline for the eight patients who were asked to provide remote autobiographical 
memories. The figure illustrates the time period from which each patient was asked to draw 
remote autobiographical memories, the patient’s age at the onset of amnesia, and age at the time 
of testing 
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Fig. 7. The number of details contained in remote autobiographical memories. Each participant 
was given 24 cue words (e.g., river, bottle, nail) one at a time, and asked to recollect a specific event 
from the first third of life that involved the word. Narratives were collected in two separate studies 
by different interviewers. Panels (A) (first study) and (B) (second study) show the mean number of 
details per narrative that described specific autobiographical events (episodic memory). Panels 
(C) (first study) and (D) (second study) show the mean number of details per narrative that 
were recounted as part of the recollections but were not unique to a specific episode (semantic 
memory). Each participant is represented by a filled circle, and patients are identified by their 
initials. H = patients with lesions thought to be limited to the hippocampal region (hippocampus 
proper, dentate gyrus, and subicular complex); MTL = post-encephalitic patients with large 
medial temporal lobe lesions; CON = controls 


relevant to the narrative (e.g., the name of the school). Figure 7 illustrates the results, 
shown as two separate studies. (The two studies were identical except that they were 
conducted by different interviewers.) The interviewer for the second study did not 
probe for details as extensively as the interviewer for the first study, which resulted in 
somewhat fewer elicited details for controls and patients alike in the second study). 
The recollections of the patients and the controls contained a similar number of details 
(+5%). Approximately two-thirds of the total details were classified as episodic details, 
and the remaining one-third were classified as semantic details. The patients and the 
controls also performed similarly in a number of other respects. They took a similar 
amount of time to begin their narratives, received a similar number of prompts before 
beginning, and took a similar amount of time to report the narratives. 

The reliability and accuracy of the methods used in the study were also checked. To 
assess reliability, the narratives from 10 of the participants (four patients, six controls) 
were scored independently by two raters. Both raters assigned the amnesic patients and 
controls a similar number of details (for both raters, < 1.0 detail/narrative separated 
the patients and the controls). An effort was also made to determine the accuracy 
of the narratives. After the initial interview (median = 14 months), the patients and 
controls were provided a few cues for each of their recollections and were then asked 
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four specific questions about each narrative that was specific to a particular time 
and place (maximum = 24). The rationale was that participants (and especially the 
patients) would have difficulty answering questions about a narrative if it had been 
manufactured at the time of the original interview. Using this method, 88.0% of the 
patient recollections and 93.4% of the control recollections were considered to be 
confirmed (P > 0.10). 

These results indicate that patients with lesions to the medial temporal lobe are 
able to recollect detailed, remote autobiographical memories as well as controls do. 
This conclusion should be qualified in two ways. First, it is not possible to conclude that 
the autobiographical memories of the patients are intact in every respect. Although 
we were unable to find a difference, either qualitatively or quantitatively, between the 
patient and control memories, it remains possible that the two groups differed in 
some way that was not detected by our analysis. However, the results clearly rule out 
the possibility that patients with medial temporal lobe lesions are grossly deficient at 
recollecting remote autobiographical events. 

Second, it is difficult to determine to what extent the recollections provided by the 
patients can be called episodic memories in the same sense that healthy individuals can 
recall episodes from their recent past. However, it is also difficult to make this deter- 
mination for recollections provided by the control group. One interesting possibility 
is that autobiographical memories become more fact-like and less episodic as they 
become more remote. For example, Cermak (1984) proposed that remote memory in 
amnesia consists mostly of frequently told stories that have become disconnected from 
their original context and are now part of semantic knowledge. If this proposal applies 
to patients with damage restricted to the medial temporal lobe, we would suggest that 
the same notion also applies to healthy individuals. 

Although the study described above found that the memories provided by the 
patients were as rich and detailed as the memories provided by the controls, others have 
suggested that autobiographical memory remains dependent on the medial temporal 
lobe for as long as the memory persists (Fujii et al. 2000; Rosenbaum et al. 2001). 
Accordingly, in a second study (Bayley et al. 2005b), we investigated the locus and extent 
of damage in the patients who can successfully recollect autobiographical memories 
and compared this damage to that in patients who cannot recollect autobiographical 
memories. In this study, a detailed volumetric analysis was performed on the magnetic 
resonance images from memory-impaired patients and controls. Five of the patients 
had taken part in the study just described (Bayley et al. 2003) and had damage limited 
to the medial temporal lobe (designated as the “MTL” group). These five patients 
could recollect remote autobiographical memories as successfully as their controls. 
Three other patients also took part. They were known to have medial temporal lobe 
damage plus significant additional damage to neocortex (the “MTL+” group). These 
three patients were strikingly impaired in the recollection of remote autobiographical 
memory. 

Autobiographical memories were elicited using the methods described above (Bay- 
ley et al. 2003), in which a list of 24 high-frequency nouns were presented one at a time 
with the instruction to recollect a unique event that involved the stimulus word and 
that was specific to time and place. Narratives were tape-recorded and later scored on 
a 0 to 3 scale. Three points were awarded for an episodic memory that was specific to 
time and place (e.g., a description of the day the participant started at a new school). 
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Fig. 8. Performance on a test of remote autobiographical memory. Participants were given 24 
cue words (e.g., river, bottle, nail) and asked to recollect a specific event that was associated with 
each word. Patients were asked to recall events from the first third of their life before the onset 
of amnesia. Controls were asked for events from corresponding time periods. Narratives were 
scored (0 to 3) for how well they described an event (0 = no response or a generic response, 
1 = vague reference to a memory without any reference to time or place, 2 = memory had some 
specificity but was not specific to one time and place, 3 = memory was specific to one time and 
place). The bars show the mean number of narratives given each score. MTL, six patients with 
hippocampal lesions or larger medial temporal lobe lesions; MTL+, three patients with medial 
temporal lobe lesions and significant additional damage to neocortex; CON, 26 controls 


Two points were awarded for a memory that had some specificity but was not specific 
in time and place and was therefore not recalled as a unique event (e.g., “I used to stay 
at my grandma's house on weekends”). One point was awarded for a vague reference 
(e.g., “I read a lot of books”). Zero points were given for no response or a generic 
response (e.g., “You can open and close a door”). 

The quality of the autobiographical memories of the two patient groups (MTL and 
MTL+) and 26 controls are illustrated in Fig. 8. The patients in the MTL group and 
controls were able to provide unique autobiographical memories (score of 3 points) 
in response to most of the 24 cue words (MTL patients, 21.6 memories; controls, 22.9 
memories). In contrast, the three patients in the MTL+ group provided a mean of only 
4.0 unique autobiographical memories in response to the 24 cue words. The majority 
of the memories provided by the MTL+ group were awarded just one point, suggesting 
that these patients were able to recall general information but had difficulty providing 
memories that were specific to time and place. 

For the volumetric analysis of brain images, two medial temporal lobe structures 
were defined for each hemisphere: the hippocampal region (hippocampus proper, 
dentate gyrus, and subicular complex) and the parahippocampal gyrus (perirhinal, 
entorhinal, and parahippocampal cortices). To obtain volumetric data for the neocor- 
tex, twelve regions of interest were defined, including the frontal lobes, lateral temporal 
lobes, parietal lobes, occipital lobes, insular cortex and fusiform gyrus (both left and 
right sides). Twelve controls were matched to the patients and also analyzed. 

In the MTL group, three of the patients (R.S., G.W., J.R.W.) were found to have 
substantial volume reductions within the hippocampal region but, with one exception, 
no reduction in the parahippocampal gyrus, the fusiform gyrus, the insular cortex, or 
the major lobes of the brain. The one exception was patient R.S., whose parietal lobes 
are unusually small. This finding likely reflects natural variation in the parietal lobe 
volume rather than damage to this brain region for the following reasons: 1) the parietal 
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lobes are highly variable in size (Raz et al. 2005); 2) no evidence of parietal lobe damage 
is apparent in R.S’s MRI scan; and 3) R.S. obtained normal scores on tests sensitive 
to parietal lobe function. Patients E.P. and G.P. have more extensive medial temporal 
lobe damage than the other patients in the MTL group. Specifically, the hippocampal 
region and parahippocampal gyrus are markedly reduced in volume bilaterally. The 
major lobes of the brain were of normal volume. 

In contrast to the patients in the MTL group, patients in the MTL+ group were found 
to have reduced volumes of medial temporal lobe structures and additional, significant 
reductions in the volume of one or more of the major lobes. Specifically, H.C. has 
reduced volumes of the frontal, parietal and occipital lobes. These reduced volumes 
are bilateral, although the volume reduction in H.C’s frontal lobe reached significance 
only in the right hemisphere. Patient P.H. was found to have reduced volume in the 
left frontal lobe and a similar, albeit not significant, reduction in the right frontal lobe. 
Lastly, patient G.T. has reduced volumes in the lateral temporal lobes bilaterally. 

The major finding was that patients with significant damage to neocortex were 
markedly impaired on tests of remote autobiographical memory, whereas patients 
with damage restricted to the medial temporal lobe performed normally. The three 
patients who had difficulty in recalling remote memories all had damage to areas of 
the cortex that have been implicated in remote memory. For example, patient G.T. had 
damage to the lateral temporal cortex, which when damaged is known to impair remote 
autobiographical memory (Graham and Hodges 1997). Further, patients H.C. and P.H. 
both had frontal lobe damage. The frontal lobe is known to be important for a variety 
of “executive” functions that are important for the strategic aspects of recall and for 
active or effortful reconstructive processes (Kopelman 2002). Furthermore, frontal lobe 
damage has been reported to impair autobiographical memory (Kopelman et al. 2003). 
Lastly, patient H.C. had damage to the occipital lobe. The occipital region has been 
associated with the retrieval of visual images, and damage to this region has also been 
associated with impaired autobiographical recollection (Rubin and Greenberg 1998). 

As described above, some memory-impaired patients have been reported to do 
poorly at recollecting autobiographical memory, and these impairments have been 
attributed to their medial temporal lobe damage (patient V.C., Cipolotti et al. 2001; 
patient H.M., Steinvorth et al. 2005; patient Y.K., Hirano et al. 2002). On the basis of 
the anatomical evidence considered above, it is important to also consider the pos- 
sibility that these patients have significant damage outside the medial temporal lobe. 
Another possibility is that the differences reported in different studies are due to 
important variations in test procedures. Indeed, it has been proposed that, until stan- 
dardized techniques are used across laboratories, differences in how remote memories 
are assessed could explain differences in findings (Rosenbaum et al. 2004). However, 
it should be noted that a simple, standardized procedure has been used to assess the 
autobiographical memories of many of the patients under study [the Autobiographical 
Memory Interview (AMI); Kopelman et al. 1989]. The AMI assesses autobiographical 
memory for childhood events using three questions (maximum score = 9 points). 

The crucial finding is that patients who reportedly do poorly at producing detailed 
remote autobiographical memories also perform poorly on the childhood portion 
of the AMI [patient Y.K. = 4/9 points (Hirano and Noguchi 1998); patient V.C. = 
1/9 points (Cipolotti et al. 2001)]. Notably, our patients in the MTL+ group, who 
have damage outside the medial temporal lobe, all failed to reach a maximum score 
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[H.C. = 7/9 points; G.T. = 0/9 points; P.H. = 0/9 points (see Bayley et al. 2005b)]. In 
contrast, our five patients in the MLT group who had damage limited mainly to the 
medial temporal lobe all achieved the maximum score of nine points on the childhood 
portion of the AMI. These results indicate that, even when a simple standardized test 
is used, one still finds differences among patients in their ability to recollect remote 
autobiographical memories. Accordingly, the different results recorded in the literature 
cannot be a function of differences in the test procedures that have been used. 

We suggest that whenever memory-impaired patients have significant difficulty 
in recollecting remote autobiographical memories, they will ultimately prove to have 
damage outside the medial temporal lobe. The ability to retrieve remote autobiograph- 
ical memories is supported by neocortex, including the frontal, lateral temporal and 
occipital lobes. It should be noted that these conclusions are consistent with findings 
from many studies using experimental animals. In these studies, after lesions to the 
hippocampus or related structures, information learned long before the onset of mem- 
ory impairment is spared relative to information learned recently (Squire et al. 2004). 
Animal studies have also demonstrated the increasing importance of the neocortex as 
memories grow older (Frankland et al. 2004; Maviel et al. 2004; Wiltgen et al. 2004; 
Frankland and Bontempi 2005). The present study demonstrates that human mem- 
ory follows a similar pattern: remote autobiographical memory is supported by the 
neocortex and is independent of the medial temporal lobe. 


Robust Habit Learning in Humans 


Habit memory is a form of nondeclarative memory that involves slowly acquired asso- 
ciations between stimuli and responses. In contrast to declarative memory, nondeclar- 
ative memory operates outside of awareness and is expressed through performance 
rather than conscious recollection. Declarative memory depends on the integrity of 
the medial temporal lobe. By contrast, habit memory is thought to be supported by 
the basal ganglia (Mishkin and Petri 1984). It has been difficult to study habit learning 
in humans because humans tend to spontaneously engage their declarative memory 
when performing memory tasks. In contrast, experimental animals readily and spon- 
taneously learn by habit, even when given a memory task that, in humans, engages 
declarative learning. 

The issue is nicely illustrated by the concurrent discrimination task (Hayes et al. 
1953; Correll and Scoville 1965). In this task, eight pairs of objects are typically presented 
five times per day, one pair at a time. One of the objects in each pair is arbitrarily 
designated as always correct, and the choice of that object produces a reward. Humans 
quickly learn this task using declarative memory and perform at approximately 90% 
correct after one or two training days. Amnesic patients are markedly impaired at 
learning the task (Squire et al. 1988; Hood et al. 1999). The findings are different 
in the monkey. First, normal monkeys learn the concurrent discrimination task only 
gradually and require several hundred trials to learn all eight pairs. Further, monkeys 
with either hippocampal lesions or large medial temporal lobe lesions acquire the task 
at anormal rate (Buffalo et al. 1998; Malamut et al. 1984; Teng et al. 2000). Lastly, in the 
monkey, damage to the caudate nucleus impairs performance on this task (Teng et al. 
2000; Fernandez-Ruiz et al. 2001). 
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One possibility is that habit learning is relatively underdeveloped in humans such 
that habit learning cannot support the learning of concurrent discrimination, even 
though the task is readily learned by animals as a habit. Alternatively, habit memory 
may be capable of supporting this kind of learning in humans, but habit learning may be 
overridden by a tendency to engage a declarative memory strategy. We explored these 
possibilities by giving the concurrent discrimination test to two profoundly amnesic 
patients, E.P. and G.P., and four matched controls (Bayley et al. 2005a). As described 
earlier, these two patients are virtually unable to acquire any new declarative knowledge 
(Stefanacci et al. 2000; Levy et al. 2004; Bayley and Squire 2005). 

The concurrent discrimination test consisted of eight pairs of junk objects (mis- 
cellaneous pieces of metal or plastic). Testing was given twice each week (40 trials 
per session), and participants were required to learn by trial and error which object 
in each pair was correct. Specifically, they were required to pick up one of the two 
objects and discover whether the word “correct” appeared under its cardboard base. 
Figure 9A shows that the controls learned the task quickly and, at the end of three 
test days, scored 95.3% correct on average. The patients were markedly impaired but 
eventually succeeded at learning. E.P. learned gradually at a nearly linear rate (linear 
trend: F(1,35) = 54.7, P < 0.001). After 36 test sessions (18 weeks), he performed at 
85.0% correct (Fig. 9B). G.P. showed a similar pattern of gradual learning (linear trend: 
F(1,27) = 13.6, P < 0.001) and after 28 test sessions (14 weeks) achieved a score of 92.5% 
correct (Fig. 9C). 

Before each session, the patients were asked to describe what they were about 
to do. Strikingly, neither patient could report any specific knowledge about the test. 
For example, E.P. was never able to say anything more specific than that he and the 
experimenter would have a conversation. And when told that the test involved objects, 
he typically suggested that he and the experimenter would discuss their use. Yet, in 
the later sessions, E.P. would conclude his comments, turn to the test, and obtain high 
scores. It is also notable that the comments offered by the patients during testing 
described an automatic kind of responding. For instance, right at the end of session 
34, E.P. was asked to describe his strategy for choosing the correct object. (He had 
just performed 80% correct). He stated, “It’s just up here [pointing to head] from the 
memory. It seems like it’s up there, and comes down and out”. 

One hallmark of habit memory is that it is inflexible and is thought to be depen- 
dent on the original training conditions for full expression (Mishkin and Petri 1984; 
Wise 1996). To test whether the knowledge acquired by the patients was inflexible, we 
administered a sorting task three to six days after the conclusion of formal training. 
For this task, all 16 objects were placed together on a table. Participants were told that 
some of the objects had been consistently designated as correct, and they were asked 
to sort the objects, placing the correct objects to one side and the other objects to 
the other side. Figure 9A shows that the controls performed well on the sorting task, 
scoring 95.3% correct. In contrast, E.P. and G.P. both failed the sorting task (56.3% 
and 50.0% correct, respectively, chance performance = 50%; Fig. 9B,C). Immediately 
afterwards, the original task was given to participants with instructions to verbalize 
their choices rather than to reach for the objects. All participants scored 90% or better. 
Seventeen days later, each patient was given the sorting task again, followed by 40 trials 
of the standard task (Fig. 9B,C). The results were the same as before. E.P. and G.P. failed 
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Fig. 9. Performance on the concurrent 
discrimination task. (A) Controls (CON) 
rapidly learned the task over three ses- 
sions and performed well on the sorting 
task three to six days later (gray bar). The 
black bar shows their performance im- 
mediately afterwards when they were re- 
quired to verbalize their choices rather than 
reach for objects. Brackets show S.E.M. (B) 
E.P. gradually learned the task across 18 
weeks. Five days later, he failed the sorting 
task (gray bar) but immediately afterwards, 
performed well in the standard task for- 
mat while verbalizing his responses (black 
bar). Seventeen days later, E.P. again failed 
the sorting task (gray bar) but performed 
well when the original conditions were re- 
established. (C) G. P. learned the task grad- 
ually during 14 weeks. He failed the sorting 
task on two different occasions, five days 
after training and again 17 days later. Like 
E.P., he performed well immediately after- 
wards when the original task format was re- 
instated (black bar: verbalizing; white bar: 
standard task) 


the sorting task (56.3% and 50.0% correct, respectively) but immediately afterwards 
succeeded at the standard task (97.5% and 90.0% correct, respectively). 

Taken together, this study illustrates that habit learning in humans can support 
tasks such as concurrent discrimination, which humans ordinarily learn as declarative 
memory. Indeed, the patients appear to learn the task rather in the way that monkeys 
learn. They improve gradually over many hundreds of trials. However, learning under 
these circumstances operates outside of awareness and independently of the medial 
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temporal lobe. What is learned is rigidly organized, and the expression of what has 
been learned depends on reinstating the original task format. 


Conclusion 


The studies reviewed in this chapter provide evidence for the distinction between 
declarative (conscious) and nondeclarative (unconscious) learning systems. In partic- 
ular, the results indicate that the medial temporal lobe is essential for the acquisition 
of new declarative memory. Thus, patients with nearly complete damage to the me- 
dial temporal lobe fail to acquire new declarative knowledge. Nevertheless, these same 
patients, and other patients who have damage limited to the hippocampal region, are 
able to recollect detailed autobiographical memories from their remote past. Study of 
other patients who fail tests of remote autobiographical memory strongly suggests that 
widespread areas of the neocortex are the permanent repositories of autobiographical 
memories. In contrast to these observations of declarative memory, nondeclarative 
memory operates unconsciously and independently of the medial temporal lobe. Even 
patients with virtually no capacity to acquire new declarative memories can nonethe- 
less acquire new information through habit learning. In sharp contrast to declarative 
memory, habit learning is slow, unconscious and rigidly organized. 
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Dynamics of Hippocampal-Cortical Interactions 
During Memory Consolidation: 
Insights from Functional Brain Imaging 


Bruno Bontempi! and Thomas P. Durkin! 


Summary 


Both clinical studies and experiments in animals have provided evidence for the ex- 
istence of a temporally graded retrograde amnesia following lesions of the medial 
temporal lobe, including the hippocampus. This form of amnesia, which is character- 
ized by a loss of memory for recent events acquired prior to the onset of amnesia while 
more remote memories are preserved, is one of the major arguments for the existence 
of a consolidation process necessary for stable, long-term memory formation. It is 
now well established that the formation of declarative memory (memories for facts 
and events) involves changes in synaptic plasticity within the medial temporal lobe. 
However, our group and others have demonstrated that the hippocampus has only 
a time-limited role in long-term memory storage of certain types of information, such 
that extrahippocampal structures, namely cortical regions, eventually become capable 
of supporting the retrieval of remote memories independently. In other words, the hip- 
pocampus does not store remote memories, yet what happens beyond the hippocampus 
remains unclear. This issue has been the subject of intense investigation and debate 
in the field of cognitive neuroscience, but to date, no convincing evidence as to the 
identity, mechanisms and putative interactions between memory systems underlying 
remote memory storage and retrieval have clearly emerged. To address this issue, we 
have conducted brain imaging experiments using (!4C)2-deoxyglucose mapping and 
analyses of changes in the expression of activity-dependent genes (c-fos and Zif268) in 
mice submitted to recent and remote spatial memory testing. Our findings show that 
memory processing and consolidation require a time-dependent hippocampal-cortical 
dialogue, ultimately enabling structured cortical networks to mediate recall and use of 
cortically stored remote memories independently. However, the cortex does not simply 
serve as a passive storage site but may also actively integrate new memories depending 
on the organization and status of pre-existing knowledge. The prefrontal cortex in 
particular appears to play a crucial role in integrating and binding information from 
distributed cortical networks and in modulating the level of hippocampal activation 
during memory recall. These findings are discussed in the context of current models 
of memory consolidation and in light of data from the recent literature in humans and 
animals. 
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Introduction 


One of the most astounding features of human memory is the fact that its capac- 
ity for persistence encompasses a time-scale ranging from milliseconds to the many 
decades that cover the life span of an individual (Squire and Kandel 2000). The idea 
that memory traces are not acquired in their definitive state at the moment of initial 
acquisition but rather undergo a gradual process of stabilization and consolidation 
over time has been a central concept in cognitive neuroscience for several decades and 
finds its roots in the clinical observations made by the French psychologist Theodule 
Ribot as early as 1881. Ribot was the first to relate memory loss after a brain insult 
to the age of the memory and to point out that recently acquired information was 
typically more impaired than remote memories. This dissociation led him to sug- 
gest the probable existence of a time-dependent process of memory reorganization, 
which later became known as Ribot’s gradient. Building on major neuroanatomical 
and neuropsychological advances during the twentieth century, further investigations 
of memory function in brain-damaged patients and animals have provided remarkable 
insights into the brain regions underlying memory processes and into how recent and 
remote memories are organized and stored in the brain (McGaugh 2000; Dudai 2004; 
Squire et al. 2004; Frankland and Bontempi 2005; Moscovitch et al. 2006). In humans, 
the medial temporal lobe, which includes the hippocampus, has been shown to play 
a crucial role in processing memory for facts and events (i.e., “declarative memories”; 
Eichenbaum 2000, 2004). As these memories mature over time, they are thought to 
become progressively independent of the hippocampus and increasingly dependent 
on other brain regions such as the cortex (Squire and Alvarez 1995; McClelland et al. 
1995; Squire et al. 2004). However, the contribution of these other brain regions and 
the neuronal mechanisms underlying the consolidation process have remained elu- 
sive, and this issue constitutes a major research topic in the field. Recent studies have 
begun to shed light on the dynamics of hippocampal-cortical interactions that enables 
recently acquired memories to be transformed into enduring memories in distributed 
cortical networks. In this chapter, we examine these recent advances, starting with 
a brief history of research into memory consolidation that has led to the proposal of 
two main, but divergent, theories concerning the role of the hippocampus in memory 
consolidation. Experimental evidence in support of each of these theories is reviewed 
in light of the existing literature and also addressed by recent functional brain imaging 
studies conducted by our group using mice. The respective roles of hippocampal and 
cortical brain regions in processing recently versus remotely acquired information and 
the mechanisms involved in cortical storage are also briefly discussed, with a partic- 
ular focus on the role of the prefrontal cortex, which we suggest progressively exerts 
a privileged role during the storage, organization and retrieval of remote memories. 


Evolution of the concept of memory consolidation 


A brief history 


The first authors to have used the term “consolidation” were Miiller and Pilzecker 
(1900). They formulated the hypothesis that information processing within the brain 
remains sensitive to interference during a certain time-period up until the point when 
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the memory traces become “consolidated”, and thereby resistant to disruption. Impor- 
tantly, these same authors also suggested that long-term memory storage of information 
(in their study, verbal material) would be mediated via the temporary persistence of 
“certain physiological processes” - nowadays commonly referred to as patterns of neu- 
ronal activation - induced by the acquisition of that information (see also Lechner et 
al. 1999). In 1903, Burnham, basing his observations on cases of retrograde amnesia, 
that is, a loss of information that was acquired prior to the onset of the amnestic event, 
described memory formation and consolidation as requiring a time-dependent pro- 
cess of repetition and association, and also included the notion of a reorganization and 
modification of the neuronal substrates underlying the storage of recently acquired 
information in order for by to integrate into long-term memory. The confirmation by 
Woodworth (1929) of retroactive interference (disruption of retention of a learning 
episode A by the later acquisition of another similar learning episode B, which Miiller 
and Pilzecker initially called “retroactive inhibition”) also provided additional support 
for the existence of the consolidation phenomenon and suggested that the neuronal 
processes engaged in the brain by a learning episode persist long after the end of the 
learning session. Karl Lashley was another pioneer in the search for the memory trace 
classically referred to as the “engram”. By performing cortical lesions of different sizes 
in rats confronting cognitive challenges in complex mazes, Lashley (1950) concluded 
that memories are not localized but are rather widely distributed throughout the cortex. 
At a more mechanistic level, Hebb (1949) and Gerard (1949) further enforced the idea 
that persistent reverberation of activity within neuronal networks activated at the time 
of encoding was likely to be crucial for converting short-term memory into long-term 
memory. 

One of the most convincing arguments in favor of a consolidation process resides 
in the temporal aspects of retrograde amnesia. This form of amnesia has been typically 
reported to be temporally graded and characterized by a loss of recent memories, but 
with a relative sparing of more remote ones, just as initially observed by Ribot. It was 
Scoville and Milner (1957) who opened a new era in the search for the anatomical 
correlates of temporally graded retrograde amnesia. Patient H.M. was diagnosed with 
severe intractable epilepsy whose focus was located within the medial temporal lobe. 
Scoville, a neurosurgeon, therefore decided to excise a large portion of H.M?s me- 
dial temporal lobe bilaterally. While effective in abolishing his epileptic seizures, this 
surgery unexpectedly produced a severe, temporally graded retrograde amnesia that 
extended back approximately 11 years (Corkin 2002). However, H.M.’s remote memo- 
ries acquired during his youth remained unaffected, as did his intellectual capabilities. 
The memory profile exhibited by H.M. and other similar case studies was interpreted 
as evidence for a time-limited role of the hippocampus and related structures of the 
medial temporal lobe in memory storage (Rempel-Clower et al. 1996). In short, once 
consolidated, memories must be stored elsewhere in the brain. 

Numerous animal studies were later successful in reproducing the temporally 
graded retrograde amnesia observed in humans. These studies particularly focused on 
attempting to define precisely the duration of the memory consolidation phase. Histor- 
ically, the most classically used protocol for this type of study consisted in evaluating 
the duration of sensitivity to amnesia by applying amnestic treatments at different time 
periods after the initial acquisition of the information and then testing for memory 
retrieval. The potential for enhancing memory by applying specific drug treatments 
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during the course of the consolidation phase further stimulated interest in this type 
of experiment (McGaugh and Herz 1972; McGaugh 1989). Taking advantage of the 
development of refined animal models of human memory in primates and rodents 
and of the use of innovative lesion approaches, many researchers then succeeded in 
identifying the anatomical components of the medial temporal lobe memory system. 
Brain structures that constitute this system are anatomically connected and are con- 
served across species such as humans, non-human primates and rodents (Amaral and 
Witter 1989; Lavenex and Amaral 2000; Suzuki and Amaral 2003). They include the 
hippocampus proper (CA1, CA3, dentate gyrus) and the subicular complex, as well as 
adjacent entorhinal, perirhinal and parahippocampal cortices. 

It rapidly became clear that the medial temporal lobe system was specifically 
important for declarative memory of the kind affected in patient H.M (Milner el 
al. 1998). Findings from all of these studies were not only reminiscent of Miiller and 
Pilzecker’s original proposal that the consolidation process represented the progressive 
post-acquisition stabilization of memories, but also provided a theoretical basis to 
differentiale at least three forms of memory that can be distinguished on the basis 
of their timescales: a labile and vulnerable form, named short-term memory, that 
processes information from seconds to minutes; a more stabilized form, known as 
long-term memory, where information can be stored from minutes to several days; and 
an enduring form, termed remote memory, that ensures storage for extended periods of 
time lasting from several weeks to a lifetime (Fig. 1). As detailed in the next paragraph, 
the different kinetics for short-term, long-term and remote memory directly reflected 
on the very concept of consolidation. These have led to what is currently referred to 
as “synaptic consolidation”, a rapid process of experience-related synaptic changes, 
and system consolidation, which involves a much slower and elaborated neuronal 
reorganization at the level of entire hippocampal-cortical neuronal circuits (McGaugh 
2000). 


Two main phases in memory consolidation 


Neurobiologists distinguish two phases of memory consolidation on the basis of their 
specific temporal kinetics. Synaptic consolidation occurs within minutes to hours 
following a learning experience and is triggered by a series of cellular and molecu- 
lar cascades that include release of neurotransmitters, activation of central receptors, 
production of second messengers, activation of various transcription factors and as- 
sociated gene signaling pathways. Ultimately, these now well-characterized metabolic 
events lead to the synthesis of proteins that are directly responsible for the occur- 
rence of morphological changes within localized neuronal networks, such as changes 
in synaptic strength and restructuring of existing connections as well as the growth 
of novel ones (Squire and Kandel 2000; Kandel 2004). Consolidation can also occur at 
a system level. In this case, system consolidation is a much slower process that can last 
for weeks to months, even up to several decades depending on the species. This process 
is characterized by a gradual reorganization of the brain regions, namely hippocampal 
and cortical circuits that support memory processing (Dudai 2004; Frankland and Bon- 
tempi 2005). The concept of system consolidation was directly inspired from studies 
of retrograde amnesia gradients that indicated that the hippocampus cannot be the 
permanent repository of long-term memory (Zola Morgan and Squire 1990). This idea 
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Fig. 1. Different stages in memory formation. Experimental and clinical evidence point to the 
existence of three main forms of independent memory that undergo parallel processing and 
consolidation, albeit on different temporal kinetics ranging from only seconds to several months 
or even a lifetime (adapted from McGaugh 2000). Consolidation is defined as the generic process 
enabling memory traces to gradually acquire stability over time. STM, short-term memory; LTM, 
long-term memory 


actually forms the central tenet of most contemporary views of system consolidation: 
the hippocampus acts as a temporary consolidation organizing device, but storage 
and retrieval of enduring memories depend on broadly distributed extrahippocampal 
networks, presumably located in the cortex. 


The standard model of memory consolidation versus the multiple trace theory: 
two divergent views of the same process 


In the so-called standard model of memory consolidation (Fig. 2), the hippocampus 
is believed to rapidly integrate and bind together information transmitted from dis- 
tributed cortical networks that support the various features of a whole experience in 
order to form a coherent memory trace. Consolidation of this new memory trace at the 
cortical level would then occur slowly via repeated reactivation of hippocampal-cortical 
networksr to progressively increase the strength and stability of cortical-cortical con- 
nections. Over time, as memories mature, the role of the hippocampus would gradually 
diminish, leaving extrahippocampal regions, presumably cortical areas, to become in- 
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Fig. 2. The standard model of memory consolidation. Perceptual, motor and cognitive informa- 
tion is initially processed by several specialized primary and associative cortical areas represented 
by cortical modules. The medial temporal lobe, including the hippocampus and related structures, 
integrates the various features of an experience and fuses them rapidly into a coherent memory 
trace. System consolidation then occurs slowly over time and involves a hippocampal-cortical 
dialogue to gradually strengthen cortical-cortical connections. This process involves not only 
the strengthening of existing cortical-cortical connections but also the creation of new connec- 
tions between previously unconnected neurons via repeated activation of hippocampal-cortical 
networks during periods of quiet wakefulness or sleep. As cortical memories mature and acquire 
stability, the functional role of the hippocampus gradually diminishes, enabling structured cor- 
tical networks to ensure retrieval of remote memories independently. A key feature of this model 
is that changes in the strength of hippocampal-cortical connections are fast but short-lasting 
whereas changes within cortical-cortical connections are slow but long-lasting. (Adapted from 
Frankland and Bontempi 2005) 


dependently capable of sustaining permanent memories and mediate their retrieval 
(Squire and Alvarez 1995; McClelland et al. 1995). 

An alternative and challenging view is offered by the multiple trace theory, which 
posits that the hippocampus retains a permanent role in memory storage and retrieval 
as long as memories exist (Nadel and Moscovitch 1997). This view is supported by three 
main lines of clinical and experimental observations that cannot be accounted for by the 
standard model of memory consolidation. First, retrograde amnesia can, in some cases, 
be ungraded (i.e., ‘flat?), wherein both recent and remote memories have been reported 
to be similarly impaired (Cipolotti et al. 2001). Second, certain retrograde amnesia 
gradients have been reported to last for decades or up to almost the entire human life 
span in some amnesic patients, thus raising the question of the ethological value of such 
an extended period of consolidation. Third, the observation of a retrograde amnesia 
gradient and its temporal extent may depend on the type of declarative memory to be 
consolidated (episodic, semantic or spatial; Moscovitch et al. 2006). The main features 
of the multiple trace theory are as follows. Each consciously experienced event would 
consist of a cohesive hippocampal-cortical ensemble. Each time that this particular 
event is recalled, it would be recreated and recoded in the form of multiple and 
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stronger related memory traces dispersed over larger areas of hippocampal-cortical 
networks. Therefore, the relative sparing of remote memories in amnesic patients would 
be a function of the extent of hippocampal damage, with limited damage producing 
temporally graded retrograde amnesia and extensive lesions, resulting in a flat gradient 
for retrograde amnesia. Nadel and Moscovitch (1997, but for review, see also Moscovitch 
et al. 2006) further posit that the recurrent creation of multiple hippocampal-cortical 
traces would predominantly favor the integration of information with preexisting 
knowledge to form old semantic memories (memory for general knowledge of facts) 
whose retrieval could possibly occur without the contribution of the medial temporal 
lobe memory system. However, in the case of remotely acquired episodic memories, 
which are autobiographical and richly detailed in nature, these authors postulated that 
retrieval would always require the contribution of hippocampal-cortical networks. 

It must be noted that the results of some case studies, for instance, patient E.P., 
who suffered large lesions of the medial temporal lobe following an episode of viral 
encephalitis, do not support the multiple trace theory (Teng and Squire 1999). Despite 
extensive hippocampal damage, this patient had excellent autobiographical memories 
from his youth and could accurately recall the spatial layout of the area where he 
grew up more than 50 years earlier. Even more astonishing was the ability of E.P. to 
mentally navigate and construct novel spatial routes to access specific locations of his 
past neighborhood, thus pointing to the existence of extrahippocampal spatial maps 
acquired long ago. In accordance with this case study, temporal gradients have been 
reported in animals even after complete hippocampal lesions. These discrepancies, 
especially the issue as to whether recalled memories in hippocampal patients such 
as E.P. or K.C. (Rosenbaum et al. 2000) are as vivid and richly detailed as in healthy 
patients, continue to be hotly debated (Bayley et al. 2003; Squire et al. 2004; Moscovitch 
et al. 2006). 


Time-dependent hippocampal-cortical interactions during 
system consolidation as revealed by brain imaging approaches 


In humans 


As outlined in the brief history above, temporally graded retrograde amnesia has been 
reported in humans and these case studies are well documented (Squire et al. 2004; 
Moscovitch et al. 2006). One striking difference with animal studies concerns the length 
of the gradient. Most often, amnesia extends back across a period of several years and, 
in some cases, even several decades. Both remote semantic memory for facts and 
remote episodic memory for autobiographical events appear to be spared, although 
some patients with hippocampal damage have been diagnosed as having difficulties in 
recollecting detailed episodes from their early life (Rosenbaum et al. 2000), possibly 
because of extensive damage outside the medial temporal lobe (Squire et al. 2004). Re- 
sults from positron emission tomography (PET) and functional brain imaging (fMRI) 
studies in humans have proven to be controversial, in part because of technical dif- 
ficulties in recording activity across subjects accurately and in designing appropriate 
comparative baseline control tasks aimed at isolating brain activity associated with 
non-mnemonic aspects of the testing procedure. However, Haist and colleagues (2001) 
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succeeded showing in the existence of a time-limited role of the medial temporal lobe 
in remote memory in healthy adults. Subjects were asked to identify faces of people 
that were famous during a particular decade from the 1940s to the 1990s. Robust hip- 
pocampal activation, as revealed by fMRI, was observed only during identification of 
faces that achieved fame recently, during the 1990s, whereas the entorhinal cortex, 
which connects the hippocampus to other widespread cortical regions, appeared to 
have a more prolonged role in supporting remote memory over decades (Eichenbaum 
2001). A recent fMRI study by Takashima and colleagues appears to be even more con- 
sistent with the imaging data obtained in animals and the observed time course of the 
retrograde amnesia gradient (Takashima et al. 2006). In contrast to the study by Haist, 
the authors were successful in studying remote memory in a prospective manner, as in 
the experimental designs classically used in animals. Healthy adult subjects were asked 
to encode and memorize a series of photographs depicting various natural landscapes. 
Recognition of these photographs was then probed during a testing session that oc- 
curred the same day, and 1 day, 1 month or 3 months later. Results show a striking 
correspondence with the imaging results obtained in animals. Not only was confident 
recall of remotely acquired pictures independent of the hippocampus and dependent 
on the medial prefrontal cortex, but this dramatic time-dependent reorganization also 
occurred at a rapid rate, of a magnitude comparable to that observed in animal studies 
of retrograde amnesia (Frankland and Bontempi 2006). 


In animals 


When it comes to studying temporally graded retrograde amnesia, animal models 
offer a certain number of advantages over human case studies (Squire 1992). First, 
the content and age of memories can be rigorously assessed across time points in 
a prospective manner, two parameters that are difficult to control for in humans, where 
studies rely almost exclusively on retrospective questionnaires. Second, the location 
and extent of lesions are strictly circumscribed to the brain area of interest. As reviewed 
by Frankland and Bontempi (2005), more than 30 studies have investigated the impact 
of disrupting hippocampal function on the retrieval of recent and remote memory. 
Close analysis reveals that the length of the retrograde amnesia gradient seems to be 
a function of several factors, including the species, the type and complexity of the 
memory paradigm, the amount of training and the type, extent and location of the 
brain lesion. The typical pattern of memory impairment induced by hippocampal 
lesions as a function of the age of the memories that emerges from all these studies is 
summarized in Fig. 3. 

Although temporally graded retrograde amnesia has been shown consistently 
across a variety of species, both in invertebrates (e.g., bees or flies) and vertebrates 
(e.g., nonhuman primates, rabbits, rodents, or pigeons), this phenomenon has been 
demonstrated almost exclusively by performing various lesions of the medial temporal 
lobe and related structures. Unfortunately, these lesion approaches do not provide any 
insights into the status of memory storage in the hippocampus at the time of memory 
retrieval. In other words, one fundamental question remains: what would have been the 
contribution of the hippocampus to remote memory retrieval had it been functionally 
intact at the time of retention testing? One powerful way to address this question is by 
using functional brain imaging. Over the past two decades considerable progress has 
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Fig. 3. Retrograde amnesia is typically temporally graded. Damage to the medial temporal 
lobe, including the hippocampus (HPC), impairs recent memory while sparing remote memory, 
pointing to a time-limited role of the hippocampus in memory storage. The period of time covered 
by the retrograde amnesia (RA) gradient corresponds to the duration of system consolidation that 
renders memory traces hippocampus-independent. Hippocampal lesions also induce incapacity 
to form any new enduring memories. AA, anterograde amnesia 


been made and techniques such as PET, and more recently fMRI, have made it possible 
to visualize the human brain at work. Similar techniques are also available in small ani- 
mals. One such technique is the ('4C)2-deoxyglucose autoradiographic method, which 
can be readily used in freely moving animals confronting various cognitive challenges 
(Sokoloff et al. 1977). Given that glucose is the sole energy source for neurons in normal 
physiological conditions, the measure of its rate of consumption in a given brain re- 
gion enables to evaluate the level of functional implication of that region in a particular 
experimental situation. The use of the radioactive analogue 2-deoxyglucose, which 
is captured by active neurons and trapped inside them, allows to obtain quantitative 
assessments of regional levels of cerebral glucose utilization, thereby providing instant 
snapshots of changes in neuronal activity throughout the brain. We adapted this tech- 
nique in mice submitted spatial discrimination testing in the eight-arm radial maze, 
wherein animals had to locate a set of three constantly baited arms using spatial cues 
scattered throughout the room containing the maze. Our main goal was to visualize 
memory reorganization during system consolidation in an intact (non-lesioned) brain 
and to determine whether retrieval of remotely acquired spatial information would 
be associated with a decreased functional involvement of the hippocampal region as 
compared to the situation where subjects were recalling recent information (Bontempi 
et al. 1999). Following acquisition of the task, animals were submitted to retention 
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testing either five days (recent memory) or 25 days (remote memory) later. The choice 
of these retention delays for testing recent and remote memory was made on the basis 
of previous observations concerning the length of the retrograde amnesia gradient 
observed in mice following restricted lesions to the entorhinal cortex (Cho et al. 1993). 
Intravenous injections of 2-deoxyglucose were carried out 1 min prior to each reten- 
tion session. Brains were collected immediately after the 35-min retention session and 
prepared for autoradiography. As shown in Fig. 4, region-specific changes in metabolic 
activity occur as memories matured over time. In accordance with the predictions of 
the standard model of memory consolidation, recall of remote as compared to recent 
memories was associated with a significant reduction of metabolic activity within the 
different regions of the hippocampal formation, including the hippocampus proper 
(CA1, CA3 and dentate gyrus) as well as the entorhinal cortex. This functional dis- 
engagement of the hippocampal formation was associated with increased metabolic 
activity in frontal, anterior cingulate and temporal cortices, indicating a greater level 
of recruitment of these regions in remote memory retrieval (Fig. 4). This particular 
profile of neuronal reorganization was specific to memories acquired 25 days earlier, 
since introducing a change in reward contingencies (exposure to a different set of baited 
arms using the same maze as initial acquisition or placing the maze in a new room 
with a different spatial layout) resulted in hippocampal reactivation, indicating that 
this region is required to encode new information and to maintain access to recently 
acquired information until further corticalization (Bontempi et al. 1999). 

More recently, we conducted complementary imaging experiments at a more cel- 
lular level by mapping the expression of activity-dependent genes such as c-fos and 
zif268 in mice submitted to a spatial discrimination task in a five-arm maze designed 
in our laboratory (Durkin et al. 2000). c-fos and zif268 are two inducible immediate 
early genes that are required for synaptic plasticity and memory formation and whose 
expression is routinely used as an index of neuronal activation (Jones et al. 2001; Hall 
et al. 2001; Fleischmann et al. 2003). Groups of mice were trained to locate the invariant 
spatial position of one baited target arm in the five-arm maze (criterion of 80% correct 
response) and subsequently tested 1 day (recent memory) or 30 days after initial acqui- 
sition (Fig. 5A). To isolate gene expression associated with the nonmnemonic aspects 
of the spatial testing procedure (arousal, locomotor activity, etc.), we generated paired 
control mice that were exposed to the maze but were not confronted to any arm choice 
during initial training or retention testing. For this group, only one target arm at a time 
was opened and this arm was either reinforced or not as a function of the response 
(correct or incorrect) of the paired experimental subjects. Brains were dissected 90 min 
after the end of a single trial of retention testing to prevent any reacquisition processes 
and were prepared for immunocytochemistry (Maviel et al. 2004). Again, a converging 
and consistent pattern of neuronal reorganization emerged: retrieval of recent memory 
engaged the hippocampus whereas retrieval of remote memories was associated with 
increased neuronal activity in widespread cortical regions such as the prefrontal, ante- 
rior cingulate and retrosplenial cortices (Fig. 5B; Fig. 6A). Reorganization of neuronal 
activation also occurred at a subregional level across cortical layers. In the parietal 
cortex, neuronal activation actually shifted from the deep cortical layers V-VI to more 
superficial layers II-III and IV (not shown, but see Maviel et al. 2004). Such laminar re- 
organization was consistent with the formation of cortical-cortical assemblies possibly 
constituting major sites of storage of encoded information, including the establishment 
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Fig. 4. System consolidation as revealed by functional brain imaging using the (!4C)2- 
deoxyglucose autoradiographic technique in mice. Color-coded autoradiographs of coronal sec- 
tions of mice tested for either recent memory (5 days, left), remote memory (25 days, center) or 
remote memory in the new context (30 days, right) following initial training in the eight-arm 
radial maze.The lower section of each panel shows a magnified view of the dorsal hippocampus 
(dHPC) that includes the CA1, CA3 and dentate gyrus (DG) regions. Lengthening the retention 
interval from 5 to 25 days resulted in a significant decrease in metabolic activity in the hippocam- 
pus and a concomitant increase in activity in several cortical regions, such as the frontal (FC) 
and anterior cingulate (aCC) cortices. These data point to a time-limited role of the hippocampus 
in remote memory storage. Over time, as memories mature, distributed cortical areas become 
capable of mediating retrieval of remote memories independently. However, when mice are con- 
fronted with a different context at a remote time point, the hippocampus is re-engaged, indicating 
that this brain region is required to encode novel information. PC, parietalcortex; pCC, posterior 
cingulate cortex. Scale bar, 2 mm. (Adapted from Bontempi et al. 1999) 


of extrahippocampal maps (Save and Poucet 2000). The parietal cortex may also con- 
stitute a storage site for very familiar environments, as recently suggested by a study 
using rats reared in a complex spatial environment. Despite subsequent extensive hip- 
pocampal damage, these animals were able to remember spatial representations of 
their environment (Winocur et al. 2005). 

Overall, the recruitment of these different cortical regions may reflect their in- 
creasing roles in memory storage, effortful recall, performance monitoring and use 
of consolidated remote memories (Buckner et al. 1999; Markowitsch 1995; Piefke et 
al. 2003; Ridderinkhof et al. 2004). However, since imaging data are only correlative, 
we next adopted an invasive approach that consisted of transiently silencing neuronal 
activity in the discrete brain regions identified on the basis of our previous imaging 
data. This approach was chosen over classical irreversible lesions to minimize com- 
pensatory phenomena within memory systems (Bures and Buresova 1990). Lidocaine, 
an anesthetic that blocks sodium channels and abolishes action potentials, was in- 
fused immediately prior to each retention session in specific brain regions such as the 
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Fig.5. Time-dependent reorganization of neuronal activity in hippocampal-cortical circuits dur- 
ing system consolidation. (A) Testing procedure in the five-arm maze. Mice were first submitted to 
spatial discrimination testing in the five-arm radial maze, wherein they were trained to locate and 
remember the spatial position of one target, constantly baited arm. Following 10 days of training 
to acquisition criterion (80% correct responses), memory performance was then assessed either 
1 day (recent test) or 30 days (remote test) later using single retention trial to prevent reacqui- 
sition phenomena. Animals underwent intracardiac perfusions 90 min after retention testing. 
Their brains were collected and prepared for visible immunocytochemistry. (B) Cellular expres- 
sion of activity-dependent genes. Zif628 counts relative to paired controls (100% baseline) are 
shown in the different regions of the hippocampal formation (dorsal and ventral hippocampus 
and interconnected entorhinal cortex) and in cortical regions (prefrontal, anterior cingulate and 
retrosplenial cortices) following recent (1 day) and remote (30 days) memory testing 


hippocampus or one of the different cortical areas. Results corroborated the imaging 
data by showing that recall of recently acquired information is initially dependent 
on hippocampal-cortical networks (hippocampus and posterior cingulate cortex) and 
then becomes progressively more dependent on distributed cortical-cortical networks 
(prefrontal and anterior cingulate cortices; Maviel et al. 2004). A similar pattern of time- 
dependent reorganization among hippocampal and cortical regions was observed in 
animals processing recent and remote contextual fear memories (Frankland et al. 2004). 

What are the neuronal mechanisms involved in remote memory storage? While 
multidisciplinary approaches in animals have contributed to the elucidation of many of 
the cellular and molecular mechanisms underlying memory processing in hippocampal 
networks during synaptic consolidation, much less is known about how the dynam- 
ics of hippocampal-cortical interactions allows these memories to be transformed into 
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Fig. 6. Wiring plasticity in the cortex underlies remote memory storage. (A) Representative 
photomicrographs showing increased Zif268 staining in the anterior cingulate cortex (aCC) on 
day 30 as compared to day 1. (B) The expression of growth-associated protein 43 (GAP-43), 
a presynaptic protein that controls axon growth and sprouting and is considered to be a marker 
of newly formed synapses, was examined in the aCC of animals tested either 1 day (recent 
memory) or 30 days (remote memory) after spatial discrimination in a five-arm maze. Cortical 
consolidation of the spatial position of the baited arm of the maze was accompanied by an increase 
in GAP-43 staining on day 30 as compared with day 1. Nuclei appear in blue (DAPI staining), 
fluorescent GAP-43 staining is green. This finding supports the concept that cortical networks 
undergo structural changes as memories mature over time. The addition of novel synapses may 
have contributed, at least in part, to increasing the extent and complexity of neuronal networks, 
as shown by the increased number of Zif268-positive nuclei in the aCC. Scale bars: (A) 100 um, 
(B) 10 um. (Adapted from Maviel et al. 2004) 


remote memories in cortical networks during the course of system consolidation. Mem- 
ory reactivation remains the core mechanism in most models of system consolidation. 
Repeated activation of hippocampal networks is thought to reinstate, in a coordinated 
and synchronized manner, activity in different cortical networks. Reactivation can be 
induced either during “online” states of conscious recollection or quiet wakefulness or 
during awake periods immediately after spatial experience (Foster and Wilson 2006). 
Replay of neuronal patterns of activity associated with a learning experience also hap- 
pens during “offline” states, such as sleep. Marr (1971) was among the first to suggest 
that offline coordinated replay of previous waking patterns of neural activity during 
sleep may drive consolidation and ensure the gradual stabilization of cortical memory 
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traces. A role for sleep in memory consolidation is now well documented. Although 
exceptions exist, a gain in retention performance after brief naps or overnight sleep 
has been reported (Hoffman and McNaugton 2002; Mednick et al. 2002; Stickgold et al. 
2000; Wagner et al. 2004; Walker et al. 2002). Conversely, sleep deprivation can inter- 
fere and even block memory consolidation. Particularly interesting is the observation 
that beneficial effects of sleep appear to be dependent on the form of memory. While 
consolidation enhancement of procedural and motor learning have been consistently 
reported, the data are less clear with regard to declarative memory, raising the possibil- 
ity that memory consolidation may involve several steps, each being driven by specific 
phases of sleep such as slow-wave sleep (SWS) or rapid eye movement sleep (REM), 
during which dreaming occurs. Sleep is characterized by a structured combination of 
neuronal oscillation that is region-specific. In the hippocampus, SWS exhibits high- 
frequency network oscillations known as “ripples,” whereas neocortical SWS activity 
is organized into low-frequency oscillations referred to as spindles. Importantly, hip- 
pocampal ripple activity has been shown to occur in temporal correlation with cortical 
spindle activity. Such coordinated replay of experience-dependent activity could thus 
represent one important mechanism that drives not only stabilization and refinement 
of memory traces in the cortex but also the updating of pre-existing cortical memories 
(Siapas and Wilson 1998; Ribeiro and Nicolelis 2004). Temporally structured replay 
of waking patterns of hippocampal ensemble activity associated with earlier learning 
is also subsequently observed during REM sleep, supporting the possibility that this 
particular phase of sleep contributes to memory trace reactivation (Louie and Wilson 
2001; Peigneux et al. 2004). 

At the molecular level, sleep has been associated with the up-regulation of around 
100 known genes (and about 400 as yet unidentified ones), independently of the phases 
of the circadian rhythm (Cirelli 2004). It is reasonable to assume that at least some 
of these genes, either directly or indirectly via triggering of specific signaling path- 
ways, mediate the changes in synaptic plasticity within distributed cortical networks. 
Two main forms of plasticity are thought to coexist within hippocampal-cortical net- 
works (Chklovskii et al. 2004). In the hippocampus, rapid encoding predominantly 
involves “weight plasticity,” that is changes in the strength between existing connec- 
tions via long-term potentiation (LTP) and long-term depression (LTD) mechanisms 
that modulate synaptic efficacy (Bliss and Collingridge 1993; Frey and Morris 1998; 
Laroche et al. 2000). In the cortex, however, where network reorganization occurs 
much more slowly, an additional form of plasticity may involve the creation of novel 
synapses between previously unconnected neurons, a process termed synaptogenesis 
(Benowitz and Routtenberg 1997). It is this form of “wiring plasticity” that we ob- 
served in our brain imaging studies in mice (Fig. 6B). Long-term storage of spatial 
and contextual information was accompanied by the synthesis of new synapses in 
specific areas of the prefrontal and anterior cingulate cortices, thus raising the pos- 
sibility that wiring plasticity contributed significantly to changing the organization 
and architecture of distributed cortical networks supporting remote memory storage 
(Frankland et al. 2004; Maviel et al. 2004). Such morphological changes appear to be 
crucial, as a-calcium/calmodulin-dependent protein kinase II mutant mice exhibiting 
normal hippocampal plasticity, but lacking cortical wiring plasticity, are unable to 
form enduring memories (Frankland et al. 2001, 2004). Since plasticity phenomena 
occur at different rates in hippocampal and cortical networks, the hippocampus has 
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been conceptualized as a fast learning area whereas the cortex would be a slow learner 
(McClelland et al. 1995). Although the mechanisms supporting this apparent division 
of labor are still unclear, it could help prevent catastrophic interference as emphasized 
in current connectionist models, protect existing cortical memories from being erased 
and ensure constant updating of these cortical memories. 


A privileged role for the prefrontal cortex 
in the processing of remote memory 


Our imaging studies, coupled to region-specific inactivations, identified the prefrontal 
cortex as playing a privileged role in processing remote memory. Classically, the pre- 
frontal lobe has been implicated in mediating temporal sequencing, effortful monitor- 
ing and response selection during memory retrieval (Ungerleider 1995; Ridderinkhof et 
al. 2004). Forgetting, which may reflect a partially degraded memory trace, is inevitably 
associated with remote memory storage; in our imaging studies, it is conceivable that 
mice had to deploy more effort to access and retrieve remotely acquired as compared 
to recently acquired information. Our findings point to additional roles played by 
the prefrontal cortex. Anatomically, the prefrontal cortex consists of several highly 
interconnected regions, including the prelimbic, infralimbic and anterior cingulate 
areas. These regions are reciprocally connected to sensory, motor and limbic cortices 
(Uylings et al. 2003) and are therefore ideally placed to integrate complex information 
from various sensory sources. This potential for integration has led us to suggest that 
the ability of the prefrontal cortex to process remote memories might mirror that of 
the hippocampus in the processing of recent information. At the time of encoding, 
the hippocampus is thought to integrate information from various independent cor- 
tical modules that represent the various features of an experience and then to rapidly 
fuse these features into a coherent memory trace (Eichenbaum 2004). As memories 
become gradually consolidated in the cortex through strengthening of cortical-cortical 
connections, this integrative function might be transferred to the prefrontal cortex, 
which would then become capable of integrating information embedded within other 
distributed cortical networks (Fig. 7). The prefrontal cortex may play another impor- 
tant executive role, that is, the modulation of hippocampal activation during remote 
memory recall. Closer examination of our imaging data revealed that neuronal activity 
in the hippocampus at the time of remote memory recall was actually significantly 
lower than that of control subjects exposed to the maze environment (Fig. 5B; Fran- 
kland et al. 2004; Maviel et al. 2004). This observation has two implications. First, it 
indicates that the observed reduction in hippocampal activation over time is not solely 
the consequence of a reduction in size of the neuronal population representing the fa- 
miliar (consolidated) memory or of a more widely distributed hippocampal networks 
supporting memory storage. Second, it raises the possibility that inhibitory influences 
arising from other brain regions, presumably cortical, that are actively engaged in 
memory storage and retrieval may progressively control the level of hippocampal par- 
ticipation during the course of memory consolidation. The prefrontal lobe has been 
shown to play a critical role in executive functions via a top-down control over pos- 
terior association cortical regions involved in sensory processing and voluntary recall 
(Tomita et al. 1999). We sought to examine whether similar executive control could be 
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exerted over the hippocampus at the time of remote memory retrieval. In preliminary 
experiments involving the five-arm maze, inactivation of the prefrontal cortex at the 
30-day retention interval resulted in a marked reactivation of the hippocampus. This 
effect was specific to the prefrontal cortex as it did not occur following inactivation 
of the other targeted cortical areas such as the anterior cingulate cortex. Functionally, 
the prefrontal cortex might control hippocampal activity via direct anatomical con- 
nections with the hippocampus or indirect modulation of posterior cortical areas from 
which originate entorhinal-hippocampal projections. A major role for this top-down 
inhibitory control would be to prevent encoding of existing (already corticalized), and 
therefore redundant, information (Fig. 7). In this scenario, whether or not inhibition 
occurs would be a function of the status of pre-existing knowledge already stored in the 
cortex. If retrieval is successful, hippocampal activation will rapidly be inhibited. How- 
ever, in the case of a mismatch situation, for example, if the actual situation requires 
encoding of novel information or if the information that is to be retrieved has been 
partially or totally forgotten, this inhibition will not be exerted and the hippocampus 
will be reengaged to enable updating and reconsolidation (Dudai 2004, 2006). This 
executive, top-down modulation of hippocampal activity is compatible with the ex- 
istence of comparator cells in the prefrontal cortex whose function is to evaluate the 
occurrence of match versus mismatch situations. 
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Fig. 7. A privileged role for the prefrontal cortex (PFC) in system consolidation. At the time 
of encoding, the hippocampus is crucial in integrating information from distributed cortical 
modules, each representing individual components of a memory. As memories mature over time, 
cortical-cortical connections are strengthened, allowing cortical regions, and particularly the 
PFC, to ensure retrieval of remote memories independently and to assume the integrative role 
initially exerted by the hippocampus. Consistent with this model, inactivation of the PFC blocks 
expression of remote memory but spares recent memory in rodent tests of contextual and spatial 
memory. The PFC might also exert executive control upon hippocampal activity during remote 
memory recall. The hippocampus is normally active when processing the outside environment. 
However, when the incoming information matches previously stored cortical information, the 
PFC will inhibit hippocampal activity via direct or indirect efferent pathways to prevent encoding 
of redundant information. In the presence of a mismatch situation, no such inhibition is exerted 
and the hippocampus will be engaged as usual. (Adapted from Frankland and Bontempi 2005) 
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Conclusion 


The definition of the term “consolidation” that is found in any dictionary is sim- 
ple: to strengthen or to make firm. Achieving consensus about its neurophysiological 
correlates, however, has proven to be much more challenging, and despite its initial 
conceptualization more than a century ago, the concept and the neuronal mecha- 
nisms underlying the consolidation process continue to be hotly debated in the field of 
cognitive neuroscience. 

One must acknowledge that, over the years, considerable progress has been made 
in unraveling the neurobiological bases of memory consolidation. Functional brain 
imaging techniques in particular, along with genetic approaches in mice and tradi- 
tional pharmacological and lesion approaches, have provided new breakthroughs in 
the identification of the different brain regions and neuronal networks supporting 
recent and remote memory storage. The consolidation process is no longer seen as 
a passive phenomenon but as a dynamic process of neuronal reorganization that af- 
fects extensive hippocampal and cortical networks over time scales that are much 
longer than those initially envisioned. While in accordance with the standard model 
of memory consolidation, our own findings highlight the fact that the cortex does not 
simply serve as a passive storage site but is also capable of actively integrating new 
memories, depending on the organization and status of pre-existing knowledge. The 
prefrontal cortex in particular appears to play a crucial role in integrating informa- 
tion from distributed cortical networks and modulating hippocampal activity during 
memory recall. It is our view that the maturation and consolidation of cortical memory 
traces should not be considered as a one time process but rather seen as an “open- 
ended” process involving continuous indexation and reorganization of pre-existing 
cortically stored information. 

However, despite considerable advances in the field of functional genomics and 
proteomics, we are still far from understanding all the precise cellular and molecular 
mechanisms underlying memory reorganization within cortical networks during the 
course of system consolidation. Numerous questions remain to be addressed. For in- 
stance, given the ongoing debate in the human memory field (Moscovitch et al. 2006), 
one might wonder whether the same pattern of neuronal reorganization over time 
applies equally to semantic and episodic declarative memories and with similar rates 
in each case. Does the eventual corticalization of memory traces entail a change in 
the initial format so that there would be an inevitable loss of details and richness? 
Experiments in intact animals using brain imaging approaches should contribute to 
resolving these debates but will require adequate tasks that effectively dissociate se- 
mantic from episodic components. Another question concerns the different kinetics 
of memory processing in the hippocampus compared to the cortex. Storage in the 
cortex happens slowly, presumably to prevent catastrophic interference and erasure of 
existing memories. In contrast, plasticity in the hippocampus can be modified at a fast 
pace. It is conceivable that different plasticity mechanisms coexist within these brain 
regions, but their cellular and molecular correlates remain to be elucidated. Interest- 
ingly, upon reactivation, cortical memories have been shown to become labile again 
and to require another round of protein synthesis to regain stability. While this process, 
termed reconsolidation, highlights the possibility for updating existing stored cortical 
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memories, the similarity and the potential differences in the mechanisms governing 
consolidation and reconsolidation are yet to be identified (Dudai 2006). 

Answers to these questions will require integrative approaches that involve cross- 
species analyses and combine several levels of analyses ranging from innovative cog- 
nitive paradigms to cellular, molecular, imaging and electrophysiological techniques, 
enabling us to probe not only the functional involvement of single neurons but also the 
functioning of complex neuronal ensembles. 
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From Molecule to Memory System: 
Genetic Analyses in Drosophila 
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Summary 


Despite the fact that the Drosophila brain has only 100000 cells, it produces complex 
behaviors and sustains various forms of learning and memory. We show here how the 
power of Drosophila molecular genetics permits the interconnecting of the different 
levels of memory analyses: molecular, network and system. In particular new, tools 
allow a precise spatial and temporal control of network activity, as well as recording of 
brain activity. Recent results indicate that the Drosophila brain is the site of complex 
phenomena more frequently associated with mammalian species. 


Introduction 


Drosophila melanogaster is one of the most intensively studied organisms in biology, 
and it serves as a model system for the investigation of many cellular, developmental 
and behavioral processes common to other species, including humans. This usefulness 
holds in particular for brain studies, as the Drosophila central nervous system (Fig. 1) 
is made of neurons and glia that operate on the same fundamental principles as 
their mammalian counterparts. Thus, most neurotransmitters are identical in flies and 
humans, and despite the fact that the Drosophila brain has only 100 000 cells (Shimada 
et al. 2005), it produces complex behaviors and sustains various forms of learning 
and memory. Beside studies of fundamental brain properties, fly models are being 
developed for a variety of neurodegenerative disorders, and the field is beginning to 
harness the power of Drosophila genetics to dissect pathways of disease pathogenesis 
and identify potential targets for therapeutic intervention. This approach is made 
possible by the fact that about 50% of human genes have a Drosophila orthologue 
(Rubin et al. 2000). In addition, transgenic strategies that allow the introduction of 
human genes into Drosophila continue to expand the list of modeled diseases, which 
now includes Parkinson’s disease, Alzheimer’s disease, Huntington’s disease and several 
spinocerebellar ataxias (for reviews, see Shulman et al. 2003; Bier 2005). 


Learning and memory genes 


In Drosophila, a single associative-learning trial (called the “short protocol”), consist- 
ing of an odor (the conditioned stimulus, CS) accompanied by electric shocks (the 
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unconditioned stimulus, US) induces olfactory learning and memory (Quinn et al. 
1974). The discovery of most of the learning and memory mutants was performed with 
this protocol. Multiple, spaced repetitions of this single protocol (called the “long pro- 
tocol”) generate long-term memory (LTM; Tully et al. 1994). We do not intend here to 
review in detail all Drosophila learning and memory genes or the various conditioning 
protocols (for reviews, see Waddell and Quinn 2001; Dubnau et al. 2003a; Heisenberg 
2003; Davis 2004). Rather, we would like to show how recent genetic technologies greatly 
facilitate the identification and study of learning and memory genes. 


Chemical mutagenesis versus transposable element mutagenesis 


Initially, Drosophila memory mutants were issued from random chemical mutagenesis 
using ethyl methane sulfonate (EMS), a potent mutagen. But because EMS induces 
mostly subtle DNA alterations, the identification of the mutated gene is often difficult. 
The first step consists in genetically mapping the mutation by a complementation test, 
using Drosophila’s deletion collection. However, this map only defines a broad area 
covering many other genes. In contrast, the use of a marked, transposable P-element 
allows one to easily identify the gene of interest. 

In the first behavioral screen, about 4000 fly stocks carrying EMS-induced muta- 
tions were tested for their ability to learn in an olfactory conditioning assay (Quinn et al., 
1974, 1979). The first Drosophila learning and/or memory mutants were dunce (dnc) 
and rutabaga (rut). Biochemical defects were observed in rut (reduced cAMP level) 
and dnc (increased cAMP level; Livingstone et al. 1984). Nevertheless, the cloning of the 
responsible gene took years and was facilitated by the use of new P-induced alleles. For 
example, the description of P-element insertions within 200 nucleotides of where the 
rut transcription started identified rut as the structural gene for the Ca**/Calmodulin- 
responsive adenylate cyclase (Han et al. 1992; Levin et al. 1992). New EMS-induced 
dnc mutants came from a screen for female sterility mutants (Mohler 1977). The dnc 
gene was identified by recombinational mapping of dnc mutations with restriction 
site polymorphisms as genetic markers (Davis and Davidson 1984). The first amnesiac 
(amn) mutant was identified in a screen for flies with affected memory (Quinn et al. 
1979). However, the amn gene was cloned as a second site suppressor of the dnc female 
sterility phenotype from a P-element-induced allele and has been repeatedly isolated 
since (Feany and Quinn 1995; Moore et al. 1998; Toba et al. 1999). The amn encodes 
a putative adenylate cyclase activating peptide. 

Another learning and/or memory mutant is radish (rsh; Folkers et al. 1993). The rsh 
gene was localized within a 180-kb interval in the 11D-E region of the X-chromosome, 
and several candidate genes were identified (Folkers et al. 1993). Recently, Chiang 
and colleagues reported (2004) that the responsible gene for the rsh phenotype was 
a phospholipase A2 (Chiang et al. 2004). However, this original finding failed to be 
reproduced by Dubnav and colleagues, and a second team has reported that rsh encodes 
a novel protein with possible nuclear localization motifs (Folkers et al. 2006). This 
discrepancy illustrates the difficulty linked to working with EMS-induced behavioral 
mutants. 

P-element-based behavioral screens for learning and memory mutants have been 
performed. Various mutants were issued from these mutageneses, including nalyot 
(a myb-related Adf1 transcription factor; Boynton and Tully 1992; DeZazzo et al. 2000), 
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leonardo (a zeta isoform of the 14-3-3 protein; Skoulakis and Davis 1996), and volado 
(two splice variants of an a-integrin; Grotewiel et al. 1998). 

Dubnau and colleagues (2003b) recently performed a behavioral screen for LTM 
mutants, in parallel with microarray experiments, aimed at selecting genes with altered 
expression after LTM training. This work led to the identification of proteins involved 
in mRNA processing and translation (Dubnau et al. 2003b). 

We recently described crammer (cer), a gene involved specifically in the set up of 
LTM (Comas et al. 2004). The P[GAL4] cer strain has a reduced LTM but a normal short- 
term and middle-term memory. Interestingly, in the wild-type strain, cer is transiently 
underexpressed three hours after LTM training. As the Cer peptide is an inhibitor of 
cysteine proteinases, the decrease in its expression shortly after intensive training must 
lead to a transient activation of its cysteine proteinase(s) target(s) (Comas et al. 2004). 
All together, these works demonstrate the power of the forward-genetic approach: 
learning or memory mutants are first characterized without knowing in advance the 
molecular function of the gene involved. 


Spatial control of gene expression 


A powerful tool of spatial control of gene expression is the GAL4/UAS enhancer-trap 
technique (Bellen et al. 1989; Wilson et al. 1989; Brand and Perrimon 1993; Fig. 2), 
which enables a selective activation of any cloned gene in a wide variety of tissues and 
cells (Fig. 2). An enhancer-trap element is a transposon containing an exogenous gene, 
such as the yeast transcriptional activator GAL4. Insertions that occur in close prox- 
imity to a transcriptional enhancer cause the GAL4 gene to be expressed in a pattern 
reflecting the enhancer’s spatio-temporal regulatory properties. Thousands of lines 
carrying an insertion of the P[GAL4] element are available. With a single cross, flies 
are created that carry a second P-element with a reporter gene inserted downstream 
of GAL4 binding sites: the upstream activating sequences (UAS). The reporter gene is 
expressed in the same cells as GAL4. This flexible system allows for, for example, the 
labeling of a particular group of cells if the reporter gene encodes the green fluorescent 
protein (GFP; Fig. 2). Yang and colleagues (1995) characterized the intrinsic cells of 
the Drosophila olfactory memory center, the mushroom bodies (MBs; Fig. 1; Yang et al. 
1995). Rather than being homogeneous, the 5000 MB neurons were compound neu- 
ropils in which parallel subcomponents exhibited discrete patterns of gene expression 
(Crittenden et al. 1998; Strausfeld et al. 2003). Parallel channels of information flow, 
perhaps with different computational properties, subserve different roles (see below). 

Using the enhancer-trap system, Ito and colleagues (1998) searched for MB extrinsic 
neurons. They used a reporter construct encoding the presynaptic protein, neuronal 
synaptobrevin-green fluorescent protein, and showed that output MB neurons are 
scarce and that, surprisingly, few MB extrinsic neurons project to the deutocerebrum, 
the premotor pathway that is the immediate modifier of behavior. 


Temporal and spatial control of gene expression 


When a gene is mutated, the biochemical function of the protein may be affected, or the 
transcription level of the normal mRNA may be decreased. The resulting biochemical 
defect may occur during development and/or adult life. Thus, a learning and/or memory 
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Fig. 1. Drosophila olfactory system. Olfactory sensory neurons project to the antennal lobes 
(AL). From there, projection neurons project through the antenno-glomerular tract (AGT) and 
connect MB dendrites localized in the calyx (Ca), as well as the lateral horn (LH). Each MB is 
composed of about 2,500 neurons, the Kenyon cells (KC). Three type of KC project in five lobes: 
a/B, a'/B’ and y 


defect may be due to structural brain defects rather than to a specific physiological 
alteration. This issue is crucial, as even a faint developmental defect may have major 
consequences on brain function. 

Two recent techniques, developed by the laboratory of Ronald Davis, permit the 
discrimination of a role in brain development from a role in memory formation per 
se. TARGET and Gene-Switch (Leung and Waddell 2004) are variants of the invaluable 
GAL4/UAS expression system. TARGET, which stands for Temporal and Regional Gene 
Expression Targeting, uses a modified temperature-sensitive yeast, GAL80 transcrip- 
tion factor (GAL80*) to repress GAL4 activity (Fig. 3). At low temperature (typically 
18 °C), the functional GAL80* protein inhibits GAL4 transcriptional activity. At 30 °C, 
GAL80* becomes inactive and allows the GAL4-mediated transcription (Fig. 3). This 
tool can be used to rescue the defect of a memory mutant by expressing the normal 
protein in specific regions of the adult brain while preventing its expression during 
development. Conversely, if the UAS downstream gene expresses a double-stranded 
interferent RNA (RNAi), one can lower the expression of a given gene in specific adult 
circuits (Fig. 3). Thus, the TARGET technique allows us to increase or decrease the 
expression of a precise gene in a temporal and regional manner. 
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Fig. 2. The GAL4/UAS system. When an enhancer-trap transposable element is inserted near 
transcription enhancers that control expression in a given structure, the GAL4 gene that is 
contained in the P-element is expressed in the same structure. If this fly also contains a reporter 
gene downstream of the UAS sequences, GAL4 will transcribe the reporter. MB, mushroom body 
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Fig. 3. The TARGET technique helps to discriminate a developmental gene action from an adult 
physiological effect. A sequence encoding an inverted repeat RNA (RNAi) is placed under UAS 
control. (A) GAL4 activates RNAi expression during development and adulthood. A memory 
defect could be caused indirectly by structural defects occurring during development. (B) The 
TARGET system solves this problem. Flies are maintained at low temperature during their devel- 
opmental stages. At this temperature, GAL80“ inhibits the transcription activity of GAL4, so no 
RNAi is expressed. When adults emerge, they are transferred at permissive temperature (30 °C); 
the GAL80* protein becomes ineffective and the GAL4 activates the transcription of the RNAi 
in adult MBs. The putative memory defects are due to the decreased concentration of the mRNA 
targeted by the RNAi 


46 G. Isabel et al. 


Gene-Switch uses a different mechanism to temporally control GAL4 activity 
(Burcin et al. 1999). The GAL4 DNA-binding domain was fused to a mutated proges- 
terone receptor ligand-binding domain and part of the NF-B p65 activation domain. 
This synthetic transcription factor is active only when the antiprogestin, mifeprestone 
(RU486), is present. When flies are fed RU486, Gene-Switch is on and the UAS-transgene 
is expressed. As with the GAL4/UAS system, region-specific Gene-Switch activity is ac- 
complished by the use of specific enhancer (Osterwalder et al. 2001; Roman et al. 2001). 
The use of these two techniques for olfactory memory studies will be detailed in the 
next section. 

Finally, we recently developed a new GAL4 tool using an inducible chimerical 
protein that carries the Suppressor-of-hairy-wing repressor domain fused to the GAL4 
DNA-binding domain. This tool also allows a temporally controlled repression of genes 
located near a P[UAS] insertion (Pascual et al. 2005). 


Studying networks implicated in memory 


Olfactory short-term memory 


The MBs, centers of olfactory short-lasting memory 

The insect brain contains a pair of prominent and characteristically shaped neural 
centers, the MBs. In Drosophila, MBs are composed of three main classes of neurons 
whose axons divide to form two vertical lobes (a and a’) and three median ones (p, 
Band y) (Crittenden et al. 1998; Fig. 1). MBs are a specialized neuropile involved in 
processing and storing multimodal sensory information (Li and Strausfeld 1997). In 
the 1970s and ‘80s, the function of the MBs was assessed in different insect species 
with the classical interventionism approaches, cooling or ablation (Menzel et al. 1974; 
Erber et al. 1980). Over the years, MBs have been implicated in olfactory learning and 
memory and in a variety of complex functions, including courtship, motor control and 
spatial recognition. 

Brain mutants with MB structural defects were isolated in the 1980s. The use 
of unlimited numbers of mutant animals with the same anatomical defect without 
interventionism helped to highlight the role of MBs in olfactory learning and memory 
(Heisenberg et al. 1985). However, one caveat of this approach is that the anatomical 
defects are not specific to the MBs (de Belle and Heisenberg 1994). An alternative 
approach was used to generate flies without MBs (de Belle and Heisenberg 1994). 
Hydroxyurea, an antimitotic agent, was fed to newly hatched wild-type larvae. At that 
early developmental stage, only five neuroblasts - the neurons’ precursor cells - are 
mitotically active within each brain hemisphere: the four that generate the MBs and one 
in the antennal lobe. Thus, hydroxyurea treatment led to viable adult flies with almost 
no MBs. These flies turned out to show no olfactory memory (de Belle and Heisenberg 
1994). The essential role of MBs in olfactory memory is further outlined by the high 
expression in the MBs of many of the proteins involved in learning and memory, such 
as Dnc (Nighorn et al. 1991), Rut (Han et al. 1992) and DCO (Skoulakis et al. 1993). 
Even though Amn (Quinn et al. 1979) is not expressed in the MBs, it is expressed by the 
dorsal paired median neurons (DPM) that project onto the MBs (Waddell et al. 2000). 
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Thus, Drosophila MBs are required for olfactory learning and memory, and insights 
gained from the first mutants have indicated that the cAMP pathway plays a key role for 
memory establishment. Nevertheless, it remained to be proven that cAMP metabolism 
was strictly required in MBs for correct learning and memory. Thanks to the GAL4/UAS 
system (Brand and Perrimon 1993), Connolly and colleagues (1996) disrupted MB 
cAMP signaling by expressing a constitutively activated G-protein (Ga). Permanent 
adenylyl cyclase activation led to an impaired associative memory. 

That memory is impaired after ablation or functional disruption ofa brain structure 
does not necessarily imply that the memory trace itselfis localized within this structure. 
To localize short-term memory (STM), it was necessary to rescue the memory abilities 
of a mutant by expressing the corresponding protein in a specific brain structure. 
In a first step, Zars and colleagues (2000) expressed Rut Ca”*/Calmodulin adenylyl 
cyclase in MBs to restore a normal learning capacity. However, in this experiment, Rut 
was expressed in the MBs at the adult stage but also during development. Thus, the 
behavioral rescue might have been due to the correction of a MB developmental defect, 
the indirect cause of the learning defect. Thanks to the TARGET and SWITCH methods 
(see above), McGuire and colleagues (2003) showed that the presence of Rut in adult 
MBs alone was sufficient to rescue rut memory defect. The current view is that Rut 
adenylyl cyclase could act during learning as a coincidence detector for the US and CS 
(McGuire et al., 2003). 

Thanks to the GAL4/UAS system, Waddell et al. (2000) demonstrated a role for 
the Amn peptide in a well-defined group of cerebral neurons involved in middle-term 
memory formation. In the amn memory mutant, the amn* gene may be expressed 
in only two neurons (DPM) to re-establish normal olfactory memory (Waddell et al. 
2000). DPM neurons project onto the MBs. 


A tool to build anatomo-functional maps 

A sophisticated approach to disturb neuronal circuits, based on a rapid and reversible 
blockage of synaptic transmission, was developed by Kitamoto (2001). The shibire (shi) 
gene encodes a microtubule-associated GTPase, Dynamin, which is involved in endo- 
cytosis and is essential for synaptic vesicle recycling and maintenance of the readily 
releasable pool of synaptic vesicles (Chen et al. 1991). The temperature-sensitive allele, 
shi's', is defective in vesicle recycling at restrictive temperatures (> 29 °C), resulting 
in a rapid blockage of synaptic transmission. The shi‘! mutation has a dominant ef- 
fect because it blocks chemical synapses even in the presence of a normal shi‘ allele. 
The expression of shi"! tool can therefore be used within the GAL4/UAS system. The 
GAL4/UAS-shi‘*! approach is very powerful, as it allows the inhibition of particular 
brain circuits at a precise time. Are the MBs required during the acquisition, consol- 
idation or retrieval phase? It was possible to address this question by expressing the 
temperature-sensitive Shi“! protein specifically in MB neurons to transiently disrupt 
synaptic neurotransmission (Dubnau et al. 2001; McGuire et al. 2001). It was shown 
that the synaptic outputs of MBs neurons were required during retrieval of the STM 
but not during acquisition or consolidation. All together, these data indicate that a STM 
trace can be localized in MBs. Circuits that connect to the MBs have also been studied 
with Shi“!. Thus, it has been established that abolishing vesicle-mediated secretion 
from the DPM in a precise time window, during the consolidation phase (Keene et al. 


48 G. Isabel et al. 


2004), phenocopies amn mutant memory defect (Waddell et al. 2000); likewise, blocking 
dopaminergic neurons (which project onto MB lobes) only during acquisition disrupts 
memory (Schwaerzel et al. 2003). 


Imaging brain activity 

One major caveat of drosophila central brain studies is that direct electrophysiological 
analysis is scarce (Wilson et al., 2004), due to the small size of neuron cell bodies 
(less than 5m in diameter). To circumvent this difficulty, two partially alternative 
approaches have been followed: analysis of learning memory mutants at the neuro- 
muscular junction (Zhong and Wu 1991; Renger et al. 2000), which leaves out the 
complexity of brain physiology, and analysis of isolated MB neurons in culture (Wright 
and Zhong 1995). Those experimental systems can provide interesting molecular and 
cellular information, but they are inadequate for assessing neuronal function at the 
level necessary for a global understanding of memory systems. 

Odor processing occurs in a complex tissue environment, and identification of the 
repertoire of brain cell assemblies involved in olfactory memory requires visualization 
of the network activity at high spatial and temporal resolution, in preparations that 
are as intact as possible. Optical neural activity recordings allow study of active brains 
with micrometer-spatial resolution, and activity-sensitive fluorescent probes have been 
recently used in Drosophila. Interestingly, those sensors are proteins and therefore their 
expression can be driven to specific subsets of neurons with the GAL4/UAS system. 
Several sensors have been successfully brought to Drosophila; they monitor the local 
change of pH that accompanies neurotransmitter release (Yu et al. 2004) or changes in 
the intracellular calcium concentration that provide a valuable indicator of electrical 
activity [Aequorin (Rosay et al. 2001): Cameleon (Fiala et al. 2002), Camgaroo (Yu 
et al. 2003), and G-CaMP (Wang et al. 2003, 2004)]. Using the G-CaMP reporter and 
two-photon microscopy, stereotyped odor-evoked patterns have been observed in the 
antennal lobe glomeruli (Wang et al. 2003) and in the MBs (Wang et al. 2004). 

The ultimate goal of imaging studies is to build a functional map of cell assemblies 
encoding memory in different regions of Drosophila brain by comparing the activity 
of trained and naive animals, in normal flies or memory mutants. A first step was 
achieved when a transient change in the spatial code was observed in the antennal 
lobe of wild-type flies three min after olfactory associative conditioning (Yu et al. 
2004). Another study has shown that, in naive flies, electric shock generates a strong 
activity in dopaminergic neurons whereas the odor generates a weak signal. How- 
ever, after several pairings between the odor and the shock, odor-evoked activity is 
significantly prolonged. In agreement with the behavioral approach (Schwaerzel et al. 
2003), in vivo imaging therefore suggests that dopaminergic neurons play a role in 
aversive reinforcement in Drosophila (Riemensperger et al. 2005). In the same way, Yu 
et al. (2004) studied the function of the Amn-expressing DPM neurons in the memory 
process. DPM neurons responded to both the shock (the US) and to the odor (the 
CS), and pairing the CS and the US increased odor-evoked calcium signals in the 
same time window during which DPM neuron synaptic transmission was required 
for normal memory (Yu et al. 2004). The behavioral experiments using thermosensi- 
tive shibire (Waddell et al. 2000; Keene et al. 2004) were strengthened once again by 
imaging. 
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Olfactory LTM 


As seen above, in addition to STM, Drosophila can display LTM after a spaced and 
repeated conditioning (long protocol; Tully et al. 1994). LTM can last several days and 
depends on de novo protein synthesis. Are MBs implicated in Drosophila LTM? To 
answer this question we analyzed the alpha-lobes absent (ala) mutant, which shows 
a peculiar MB phenotype. Ten percent of ala individuals possess all five MB lobes, 
36% lack the horizontal B and p’ lobes, and 4.5% lack the vertical a and œ lobes (the 
remaining sub-populations presented different MB phenotypes in the left and the right 
hemisphere; Pascual and Preat 2001). ala mutant flies were trained with the short or 
the long protocol, and we analyzed separately the brain of flies that had made the 
correct and the wrong choice during the memory test to calculate the memory score 
of each class of ala mutants. Flies lacking a/a’ lobes displayed no LTM, although their 
short-lasting memories were wild-type. Flies with all lobes present or without B/p’ 
lobes had a normal STM and LTM. Thus, MBs are necessary to perform LTM and, more 
particularly, the a/a’ vertical lobes (Pascual and Preat 2001). It was further shown that, 
by expressing Shi“! in a/f lobes, a lobes outputs are required during LTM retrieval 
(Isabel et al., 2004). 

If MBs play a pivotal role in Drosophila olfactory learning and memory, a recent 
study suggests that brain structures located outside the MBs also participate in olfactory 
memory (Pascual et al. 2004). We have recently shown that the Drosophila brain is 
asymmetric; a small structure expressing Fasciclin II, the asymmetric body, is present 
only in the right hemisphere. Interestingly, about 7% of Canton-Special wild-type 
flies present a bilateral structure. Those symmetric flies showed no LTM at four days, 
suggesting that asymmetry may be required for generating, maintaining or retrieving 
LTM (Pascual et al. 2004). The asymmetric body is located near the central complex, 
a structure that connects brain hemispheres. However, the functional links between 
the asymmetric body and other brain circuits remain to be sorted out. 

In an elegant study, Ashraf et al. (2006) showed that protein synthesis at the 
synapse is required for LTM and that LTM formation depends on calcium/calmodulin- 
dependent Kinase II (CaMKII) signalling, a pathway also implicated in synaptic plas- 
ticity in mammals (Kelleher et al. 2004). By driving the expression of a tagged CAMKII 
in projection neurons that link olfactory sensory neurons to MBs, they proved that 
the recruitment of the CAMKII expression to post-synaptic sites in the antennal lobe 
glomeruli (Fig. 1) is required to induce LTM (Ashraf et al. 2006). By performing this ex- 
pression in different mutant backgrounds, they showed that synaptic protein synthesis 
was regulated by the RNA interference silencing complex (RISC). 


Operant visual learning and memory 


In Drosophila, MBs seem especially involved in olfactory learning and memory and 
not in visual learning (Wolf et al. 1998). Flies can remember that a visual pattern 
represents a danger because this cue has been associated with heat (for review, see 
Heisenberg et al. 2001). Recently, using a variety of neurogenetic tools, Liu et al. (2006) 
identified a central brain structure, the fan-shaped body (FB), as being involved in 
visual memory (Fig. 4). After showing that the adenylate-cyclase Rut was required for 
the association between the visual pattern and the reinforcer, they restored a normal 
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Fig.4. Schematic representation of mushroom bodies (MB) and Central Complex (CX) neuropile. 
Dorsal is up; the most anterior is the first plane. MB: y lobes, striped; a and f lobes; œ and p’ 
lobes; ped, peduncle; Ca, calyx; CX: EB, ellipsoid body; FB, fan-shaped body; NO, nodulli; PB, 
protocerebral bridge 


visual memory capacity in the rut mutant by specifically expressing the rut gene in the 
upper stratum of the FB (called the F5 neurons). Interestingly, F5 neurons are required 
to learn about the horizontal elevation of a cue, whereas the ability to learn about the 
contour orientation of the pattern involves FB F1 neurons (a lower horizontal stratum 
in FB). These results suggest that memories of two different visual features - elevation 
in the panorama and contour orientation - are stored in different groups of neurons 
within the same structure (Liu et al. 2006). 


The dynamic of olfactory memory phases in drosophila 


The short protocol induces two labile phases: STM, which is disrupted in mutants 
affected for cAMP metabolism and lasts about 30 min; and middle-term memory 
(MTM), which is disrupted in amn and lasts for a few hours. STM and MTM are 
anesthesia sensitive, as they are erased if flies are cooled down to 4 °C after conditioning. 
This property suggests that STM and MTM are sustained by electrical brain activities. 

Drosophila also display two long-lasting memory phases, anesthesia-resistant 
memory (ARM; Quinn and Dudai 1976) and LTM (Tully et al. 1994; Isabel et al. 
2004). ARM is generated by the short protocol or several massed trials. ARM can 
still be detected after a few days (Tully et al. 1994). This memory is disrupted in rsh 
mutants (Folkers et al. 2006). The atypical protein kinase M (aPKM) is a persistently 
active truncated isoform of atypical protein kinase C (aPKC). Overexpression of aPKM 
enhances memory after massed conditioning but not after spaced training (Drier et 
al. 2002). Moreover, inhibition of aPKM disrupts consolidated memory after massed 
conditioning (Drier et al. 2002). It is therefore conceivable that aPKC is a molecular 
support of ARM. 

LTM is induced by the long protocol and measured for at least one week (Tully et al. 
1994). Like ARM, LTM is anesthesia resistant (Isabel and Preat, unpublished results). 
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It is disrupted by a cAMP Response Binding Protein (dCREB2-b) repressor (Yin et al. 
1994; Perazzona et al. 2004), likely due to the inhibition of gene expression required to 
establish LTM. Yin and colleagues (1995a,b) reported that flies overexpressing a partic- 
ular CREB isoform (dCREB2-a) generated LTM after a single training cycle. However, 
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Fig. 5. Model of associative memory phases (A) and temporal dynamics of memory phases 
(B) generated by a single cycle of conditioning (short protocol). LRN: learning; STM: short-term 
memory; MTM: middle-term memory; ARM: anesthesia-resistant memory 
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this work could not be replicated, and it was shown that the original dCREB2-a trans- 
genic flies carried an accidental mutation that produced a truncated protein with no 
DNA binding domain (Perazzona et al. 2004). Moreover, adult induction of the correct 
CREB2-a isoform led to lethality (Perazzona et al. 2004). 

What is the dynamic of memory phases in Drosophila? We proposed recently 
a model that involves two parallel memory pathways, one with cAMP dependent 
STM/MTM and the other with ARM (Fig. 5A). Indeed, dnc and rut retain a significant 
level of early memory (Tully and Quinn 1985), suggesting that an adenyl cyclase-Rut 
independent learning might exist. Moreover, ARM levels in rut and amn are near 
to normal (Folkers et al. 1993; Isabel et al. 2004), whereas their labile memories are 
strongly affected. Thus ARM does not seem to depend on STM/MTM. Instead a second 
learning process could give rise to a STM2 phases and later to ARM (Fig. 5). 

What are the relationships between ARM and LTM? To answer that question, 
the ala mutant was trained with the long protocol and the memory of flies lacking 
vertical «/a’ lobes was measured at 30 min and 5 hours after the training. Thirty-min 
memory was normal, but, surprisingly, five-hour memory was close to zero. Memory 
performance was normal at five hours when flies without vertical lobes were trained 
with the short protocol (Isabel et al. 2004; Fig. 6). Why does a longer training give 
rise to a weaker memory? ala flies display no LTM because they lack the vertical 
lobes, the center for LTM. These flies show a normal ARM five hours after the short 
protocol but they show no ARM after the long protocol. This result suggests that ARM 
is erased after LTM conditioning. Thus the consolidated memory phases generated by 
olfactory conditioning are exclusive (Fig. 7; Isabel et al., 2004). Why is ARM erased 
after LTM conditioning? We propose that ARM acts as a gating mechanism for LTM 
formation, avoiding a heavy cascade of gene expression in the absence of intensive 
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Fig. 6. In flies without MB alpha lobes, which normally sustain long-term memory, the long 
protocol decreases memory performance at five hours in comparison with the short protocol. 
Gray line, short protocol; black line, long protocol 
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Fig. 7. Model of associative memory phases (A) and temporal dynamics of memory phases 
(B) generated by five cycles of conditioning (long protocol). LRN: learning; STM: short-term 
memory; MTM: middle-term memory; ARM: anesthesia-resistant memory; LTM: long-term 
memory 


spaced conditioning. Despite the relative simplicity of the Drosophila brain, this model 
suggests a cognitive complexity more frequently associated with mammalian models. 
It supports the idea that Drosophila is a valid model to study some of the molecular and 
cellular mechanisms involved in normal or pathological human memory (Shulman 
et al. 2003). 
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Towards a Molecular and Cellular Understanding 
of Remote Memory 
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Summary 


While the molecular, cellular and systems mechanisms required for the initial pro- 
cessing of memory have been intensively investigated by the new field of molecular 
and cellular cognition, those underlying permanent storage remain elusive. Here, we 
review neuroanatomical, pharmacological and genetic results demonstrating that spe- 
cific areas of the cortex play a critical role in the storage of remote memory. Imaging 
experiments in rodents show that specific areas of the cortex are activated by re- 
mote memory and that this activation is impaired by manipulations that block remote 
memory. Accordingly, reversible inactivation of specific cortical structures in rodents 
disrupts remote memory without affecting recent memory. These results open a new, 
exciting window into memory studies and suggest a number of interesting experi- 
ments that will address molecular, cellular, systems and cognitive aspects of this poorly 
understood phase of memory. 


Introduction 


Memory is known to have several distinct phases, including a later phase in which 
memories are filtered and integrated into previous semantic constructs. This process 
is thought to involve molecular, cellular and systems processes that work in concert 
to transform and integrate information in the brain (McGaugh 2000; Debiec et al. 
2002; Dudai 2004). In humans, research on the gradual transformation of memory 
representations over time has focused on declarative memories and their dependence 
on structures in the medial temporal lobe (MTL). These memories require the MTL 
and they (or some version of them) are thought to eventually be “stored” in neocortical 
circuits (Squire 1994; McClelland et al. 1995; Nadel and Moscovitch 1997). Results 
of studies of hippocampus-dependent memory in animals are largely consistent with 
this general idea (Zola-Morgan and Squire 1990; Kim and Fanselow 1992; Kim et al. 
1993; Anagnostaras et al. 1999; Clark et al. 2002); but see also Sutherland et al. 2001). 
However, these studies have revealed remarkably little about the sites and mechanisms 
of remote aspects of memory. Similarly, the introduction of transgenic techniques 
has fueled an expansion of molecular and cellular studies of the very early stages 
(microseconds, seconds) of information processing in hippocampal networks (Mayford 
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and Kandel 1999; Matynia et al. 2002; Tonegawa et al. 2003) but uncovered little about the 
mechanisms responsible for remote memory processing and integration in neocortical 
networks. A series of recent experiments, including studies from our own laboratory, 
demonstrate that specific regions of the neocortex and plastic mechanisms in these 
areas are critical for remote memory (Bontempi et al. 1999; Frankland et al. 2001, 
2004; Takehara et al. 2003; Cui et al. 2004; Hayashi et al. 2004; Maviel et al. 2004; 
Remondes and Schuman 2004). One of these studies used an elegant genetic strategy 
to demonstrate that NMDAR function is critical for remote memory (Cui et al. 2004), 
and our laboratory showed that «CaMKII, one of NMDAR’s key downstream effectors, 
is also involved in this memory phase (Frankland et al. 2001, 2004). Please note that 
a more elaborate description of many of the ideas and information included in this 
session was previously published in a review by the same authors (Wiltgen et al. 2004). 


The involvement of multiple brain systems in remote memory 


Determination of the specific systems involved in remote memory began with the find- 
ing that damage to the MTL produces severe amnesia. Patients with MTL damage have 
great difficulty forming new long-term memories (Scoville and Milner 1957; Penfield 
and Milner 1958; Corkin 1984). Additionally, MTL damage affects the acquisition of 
new declarative memories, while having a lesser effect on remote memories acquired 
long before the damage. These findings suggested that the MTL is essential for the 
initial acquisition and retrieval of declarative information but that eventually another 
structure, such as the neocortex, becomes involved in remote memory (Squire 1992). 
A related observation is that when brain pathology includes damage to the neocortex, 
remote memory is often impaired (Graham and Hodges 1997; Squire et al. 2001; Bayley 
et al. 2003). This observation suggests that neocortical areas are important for remote 
memory and that, although new memories are initially dependent on the MTL, they 
gradually become dependent on neocortical circuits (Alvarez and Squire 1994). 

But why does the brain need two complementary memory systems? It is possible 
that gradual interleaving of memories in the neocortex is essential for the discovery 
of generalities and the eventual formation of knowledge structures (McClelland et 
al. 1995). The rapid and direct incorporation of new information into an intricate 
knowledge system such as the neocortex could cause catastrophic interference (Mc- 
Clelland et al. 1995). Instead, the hippocampus is thought to serve as a fast connector 
between distributed cortical memories, thus allowing these memories to be slowly 
integrated into existing knowledge systems in neocortical networks (McClelland et 
al. 1995). Besides its role in temporarily linking distributed cortical memories, the 
hippocampus is also thought to have a critical role in reactivating them. Reactivation 
by the hippocampus may serve to gradually strengthen the weak connections between 
memory fragments distributed in different neocortical sites (Buzsaki 1989) and may 
drive integration with previous related memories. 


The neocortex and remote memory 


The first evidence for neocortical engagement during in remote memory came from ex- 
periments in which animals were trained on a hippocampus-dependent spatial learning 
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task and then monitored for brain activity [using (!4C)2-deoxyglucose uptake] follow- 
ing either recent (i.e., 5 days) or remote (> 25 days) memory tests (Bontempi et al. 
1999). Retrieval of recent spatial memories produced more robust hippocampus ac- 
tivation than retrieval of remote memories, a result consistent with the model that 
proposes progressive disengagement of the hippocampus as memories are processed 
and integrated into semantic neocortical structures (Bontempi et al. 1999). In con- 
trast, several neocortical areas studied, including the prefrontal cortex (PFC), showed 
the opposite pattern of activation, with more activation during remote memory tests. 
These results provided evidence in favor of the participation of cortical networks in 
remote memory processes. They revealed, for the first time, that specific neocortical 
areas do in fact become more engaged as memories become remote (Bontempi et al. 
1999). These studies also pointed to specific regions of the neocortex that become 
activated in later stages of memory processing. The data revealed that, as memories 
change from recent to remote, the medial PFC [including the anterior cingulate cortex 
(ACC)], frontal, and temporal cortex all became more engaged (Bontempi et al. 1999). 
This finding allowed researchers to examine remote memory processes by targeting 
specific areas of the neocortex. Studies with this strategy, including our own, found 
that regions of the PFC are critical for the later stages of memory (Takehara et al. 2003; 
Frankland et al. 2004; Maviel et al. 2004). For example, hippocampal lesions in rats 
affect recent but not remote memory for trace conditioning (Takehara et al. 2003). In 
contrast, lesions of the medial PFC (including the ACC) produced the reverse gradient; 
they had only a small effect early (one day) after training but were devastating when 
made at later time points (two and four weeks). These results parallel prior activation 
data (Bontempi et al. 1999) and provide convergent evidence that activation of PFC 
regions, including the ACC, is critical for remote memory processes. 

Our laboratory (Frankland et al. 2004) found increases in neocortical immediate 
early gene (IEG) expression following the retrieval of remote context fear memories 
in mice. IEGs can be used as markers of neuronal activation, and some of these 
genes, such as Zif268, ARC, etc., are also required for long-term potentiation and 
memory. We found that retrieval of recent memories produced robust IEG expression 
in the hippocampus but not in specific cortical areas such as the PFC and temporal 
cortex. However, retrieval of remote context memories produced the opposite effect, 
i.e.; educed IEG expression in the hippocampus and increased expression in temporal 
cortex and PFC areas, including ACC. The functional importance of this activation was 
demonstrated by blocking activity in the ACC during recent and remote memory tests: 
inactivation of the ACC impaired retrieval of remote (18 and 36 days post-training) 
but not recent (one and three days post-training) context fear memories. This finding 
suggests that, while the hippocampus is engaged, context fear memories do not require 
the PFC, but with the progressive disengagement of the hippocampus, the PFC becomes 
essential for late-stage memory processes. Similarly, recent findings showed increased 
IEG expression in neocortical regions (including ACC) following the retrieval of remote 
but not recent spatial memories in mice (Maviel et al. 2004). Targeted inactivation of 
these regions impaired the retrieval of remote but not recent spatial memory. Consistent 
with similar findings by our laboratory (Frankland et al. 2004), this study also revealed 
evidence that suggests remote memory triggers synaptic structural changes in the 
neocortex. Animals tested 30 days after training showed increases in the expression of 
the growth associated protein GAP43 compared to a 1-day retention group. This finding 
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suggests that processing/integrating remote memories in cortical networks may involve 
synaptic structural changes. Another study suggesting that remote memory processes 
require synaptic remodeling involved mice with a dominant-negative transgenic PAK, 
a regulator of actin remodeling. The dominant-negative PAK transgene produced 
changes in plasticity and spine morphology in the cortex but not in the hippocampus. 
Accordingly, the mutant mice exhibited remote memory impairments (Hayashi et al. 
2004). They exhibited normal recent but abnormal remote spatial memories. These 
results are consistent with findings indicating that experience-dependent changes in 
cortical function, such as those driven by sensory deprivation, are accompanied by 
synaptic structural changes (Trachtenberg et al. 2002). 

In summary, specific areas of the neocortex are more activated by remote than by 
recent memory tests, and damage or inactivation of some of these activated regions 
selectively impairs remote but not recent memory. These studies also suggest that 
synaptic remodeling is an important facet of remote memory, suggesting that a dynamic 
reorganization of neocortical circuitry is required for integrating new information into 
pre-established semantic structures. If synaptic restructuring is occurring in specific 
neocortical areas during consolidation, one may ask what are the molecular and cellular 
mechanisms mediating these changes? 


Molecular and cellular mechanisms of remote memory 


Previous work showed that «CaMKII is essential for LTP and hippocampus-dependent 
learning (Matynia et al. 2002). Recently, our laboratory also found that this kinase plays 
a critical role in later stages of memory (Frankland et al. 2001). Spatial and contextual 
memory studies showed that a «CaMKII heterozygous null mutation (a#CaMKII+/—) 
disrupted remote memory more severely than recent memory. Remarkably, electro- 
physiological analysis of these mice found normal hippocampal but impaired neocor- 
tical LTP (likely the result of robust «CaMKII expression in the hippocampus relative 
to the cortex). It is possible that the loss of cortical LTP interfered with some aspects of 
cortical remote memory and produced the unusual amnesic phenotype seen in these 
mice. The implication is that «CaMKII is a critical factor for the cortical plasticity un- 
derlying remote memory. Just as plasticity is thought to be required for the early stages 
of memory processing in hippocampal circuits, it may also be required for the pro- 
gressive integration of this information into semantic cortical circuits. In a subsequent 
study (Frankland et al. 2001), our laboratory showed that the increases in neocortical 
activation, as measured by IEG induction, observed in normal mice following remote 
memory retrieval were completely absent in the CaMKII mutants, as if plasticity or 
aCaMKII were a key requirement for the series of events that engage the cortex during 
remote memory processes. Hippocampal activation following recent memory retrieval 
and cortical activation immediately after training were normal in the mutants, whereas 
neocortical activation after remote memory was absent. This finding again suggests 
that remote memory depends critically on «CaMKII. Strikingly, elegant transgenic 
experiments demonstrated that a key activator of aCaMKII (NMDA receptor) is also 
critical for remote memory: an inducible and reversible NMDAR knockout was used 
to show that the retention of a 9-month fear memory is severely disrupted by extended, 
but not brief, loss of the NMDA receptor. Although this temporary disruption of NMDA 
receptors affected the prior remote memory, once completed it did not block the forma- 
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tion of new memories, suggesting that the blocking of remote memory was not caused 
by irreversible damage to cortical networks caused by the temporary loss of these 
receptors. Thus, it is possible that NMDA receptor-dependent activation of aCaMKII 
in neocortex is required for remote memory. NMDA receptors and aCaMKII are also 
known to be critical for both cortical LTP and for experience-dependent plasticity in 
the visual and somatosensory cortex (Fox 2002; Taha and Stryker 2005), suggesting 
that NMDAR-dependent activation of «CaMKII is a critical mechanism for cortical 
plasticity processes involved in both sensory plasticity and remote memory. 


Mechanisms of structural plasticity: 
a possible role in remote memory 


Previous studies demonstrate that «CaMKII messenger RNA is targeted to dendrites 
and that both the kinase and its activity are enhanced by synaptic activity (Soderling 
2000). Elegant studies in the Xenopus retinotectal system demonstrated that this kinase 
has a direct modulatory role in the development of neuronal circuits (Cline 2001). These 
studies indicate that natural increases in aCaMKII expression during development 
cause a decrease in both new branch additions and retractions, thus stabilizing newly 
formed synapses (Wu et al. 1999; Cline 2001). But, is there any evidence that aCaMKII 
may also have a role in synaptic restructuring during cortical memory processes? 

Remote memory retrieval leads to increases in the levels of GAP-43 in regions 
required for remote memory (Frankland et al. 2004; Maviel et al. 2004). GAP-43 is 
a synaptic protein with a role in synaptogenesis, synaptic plasticity and remodeling, as 
well as learning and memory (Benowitz and Routtenberg 1997). Thus, this increase in 
GAP-43 in specific neocortical areas during remote memory is consistent with the idea 
that remote memory processes are associated with synaptogenesis and synaptic remod- 
eling (Wiltgen et al. 2004; Frankland and Bontempi 2005). Interestingly, the aCaMKII 
heterozygous mutants do not show the cortical increase in GAP-43 correlated with 
remote memory, an observation consistent with the hypothesis that remote memory 
may involve xCaMKII-dependent formation of new synaptic connections (Frankland 
et al. 2004). Findings in the somatosensory cortex are also consistent with the idea that 
aCaMKII modulates synaptic remodeling in cortical networks (Fox 2002). 


Imaging cortical plasticity, imaging remote memory 


Previous studies (Trachtenberg et al. 2002) used in vivo, time-lapse two- photon laser 
scanning microscopy (2P LSM) imaging to repeatedly study individual neurons in the 
barrel cortex of mice over many days and found that, although dendritic structure is 
stable, spine lifetimes vary greatly. A significant percentage of the spine population 
seems stable (lasting at least a month), whereas the rest appear to be far less stable (it is 
still unclear what the exact percentages are; Zuo et al. 2005a,b). Electron microscopic 
studies (Trachtenberg et al. 2002) indicated that synapse formation and elimination 
are associated with dynamic changes in spines. Importantly, experience-dependent 
changes in somatosensory function driven by whisker deprivation resulted in increased 
synapse turnover, a result consistent with the idea that storage of experience-dependent 
information drives changes in cortical spine structure (Trachtenberg et al. 2002; Zuo 
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et al. 2005a,b). Physiological studies had demonstrated that aCaMKII has a key role 
in experience-dependent plasticity in the somatosensory cortex (Fox 2002). Together 
these studies are consistent with the idea that experience remodels cortical circuits and 
that this process is aCaMKII-dependent. These data also suggest that at least certain 
aspects of remote memory processing share mechanisms with sensory plasticity. In 
both cases the cortex is being asked to integrate previous representations with new ones. 
This hypothesis suggests several interesting questions: Are spines stable in regions 
required for remote memory? Does memory consolidation alter the stability of spines 
in cortical regions required for remote memory? Do manipulations that disrupt remote 
memory affect spine stability in cortical regions required for remote memory? These 
anda host of related questions suggest an exciting research program on remote memory 
that will involve state of the art transgenic, physiological, imaging, and behavioral 
approaches. 


The immediate future of remote memory 


The studies summarized above open the door to a systematic and comprehensive study 
of the molecular, cellular, systems and cognitive studies of remote memory. Addition- 
ally, there is a significant amount of conceptual and modeling work to be carried 
out that will affect the exciting upcoming experimental work. For example, despite 
a considerable amount of contradictory evidence, ideas about later stages of memory 
processing by and large assume a gradual processing and storage of information that 
remains essentially unchanged as it matures or consolidates in the brain (McGaugh 
2000; Nadel et al., in press). Many of the conceptual frameworks used in memory storage 
have yet to integrate a growing body of data that has compellingly demonstrated that 
memory is not a process that faithfully records and stores information but an error- 
prone, dynamic and creative process that integrates new information with previous 
related semantic structures in a manner that is often more self-serving and subjec- 
tive than faithful and reliable (Schacter and Norman 1998). Interestingly, our growing 
knowledge of the molecular and cellular mechanisms of memory, including remote 
memory, has unraveled a series of dynamic and probabilistic molecular and cellular 
mechanisms that often lack the faithful and reliable character that is so often assumed 
in studies of memory “consolidation”. The very term “consolidation” implies a stability 
and permanence that memories rarely have. Indeed, recent studies have suggested that 
even the very process of recollection can change and alter supposedly “consolidated” 
memories (Nader et al. 2000; Sara 2000). In view of the surprising plasticity of acquired 
memories, one wonders whether the term “consolidation” has outlived its usefulness. 
Thus, it will be important to explore the character of remote memory with paradigms 
that more closely reflect the nature of remote memory. Although overtraining, strong 
emotional content, and simplicity can create memories that appear highly reliable 
and stable, real-life learning and ethologically meaningful memories rarely have these 
properties. Instead, more often than not memory has a tentative, imperfect and fuzzy 
character that is hardly captured in molecular, cellular and systems studies of memory, 
including remote memory. Thus, neuroscience studies should try to address the more 
interesting integrative and equivocal aspects of memory. Instead of failures of storage 
or recollection, they could potentially reflect highly adaptive and useful processes that 
remain unexplored. 
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How are remote memories stored in the brain? Are they distributed throughout the 
brain or are they stored close to the primary areas that initially processed them? Which 
regions “store” information, which regions organize retrieval, and more importantly, 
which circuits oversee the integration of new information into pre-established seman- 
tic structures? These are just a small sample of the vast array of cognitive neuroscience 
questions that need to be addressed concerning the neuroanatomy and processing of 
remote memory. In these studies, it will be important to not only image these processes 
but also to take full advantage of opportunities to study the impact of disruptions of 
these processes. Although often plagued with pleiotropic problems, human genetic dis- 
orders that affect cognitive function, from neurological to psychiatric conditions, offer 
precious opportunities to study brains with molecular and developmental dysfunction 
that may aid the cognitive neuroscience of memory, including remote memory. For 
example, rodent studies suggested a role for the prefrontal cortex in remote memory. It 
would be useful and informative to take advantage of the myriad of genetic conditions 
that disrupt pre-frontal function to evaluate the role of this structure in remote mem- 
ory. It may not be sufficient to search for losses of remote memory among this varied 
patient population; instead it may be more productive instead to evaluate the nature, 
content and character of remote memories in these subjects. 

Similarly, animal model studies in non-human primates, rodents, etc., also need 
to explore not only the neuroanatomy of remote memory but also the nature and 
behavioral characteristics of this form of memory (Nadel et al., in press). For example, 
recent studies suggest that in rodents, as in human subjects, remote memories lack the 
specificity and greater faithfulness of earlier memories (Wiltgen et al., 2006). It seems 
that, with time, memories become more general and less stimulus bound. These and 
other properties of remote memory need to be investigated and understood. It is clear 
that much needs to be done and that the studies summarized above have barely opened 
the door to this vast, but fascinating problem. 
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Post-Activation State: 
A Critical Rite of Passage of Memories 


Yadin Dudai! 


Summary 


Each memory item has a unique biography. The global outline of the narrative of 
this biography, however, is shared by different items. The textbook account charts the 
ontogeny of memory items as a universal linear process. First the new information is 
encoded. Then it enters a short-term persistence phase, during which it is prone to 
interference by various types of treatments, ranging from distracting sensory stimuli 
to physical agents and drugs, collectively termed “amnesic agents”. The period during 
which the item becomes gradually immune to the effect of amnesic agents is termed 
“memory consolidation”. Upon completion of consolidation, so goes the zeitgeist, the 
stabilized item enters into a long-term “store”, from which it can be later retrieved 
for use. 

The textbook account may well be wrong. First and foremost, many lines of evidence 
suggest that “storage” is a misguided metaphor and that items in long-term memory are 
not stored but rather reconstructed each time they are retrieved (Bartlett 1932; Tulving 
1983; Schacter 2001). Furthermore, this reconstruction might generate an internal 
representation that fails to faithfully replicate the representation of the original event 
(Bartlett 1932; Loftus and Loftus 1980; Schacter 2001). But even more relevant to 
the present discussion is the assumption that memory consolidates just once per item. 
Again and again, data have been reported that could be interpreted to indicate that this is 
not the case and that long-term memory items “reconsolidate” upon their reactivation. 
The latter hypothesis has been revitalized in recent years (Nader et al. 2000; Sara 2000; 
Nader 2003; Dudai 2004). The term “reconsolidation” itself is probably a misnomer, 
since whatever process the trace undergoes after its reactivation, faithful recapitulation 
of the original consolidation it is not. But few will argue with the data: something special 
and interesting happens to long-term memories following their use (Dudai 2006). 
Understanding what this “interesting thing” is, is highly relevant to our understanding 
of the nature of memory in general. Furthermore, what happens is not only of interest 
to armchair contemplation of memory: it may have important clinical implications, 
such as the ability to ameliorate post-traumatic memories (e.g., Miller et al. 2004). 

The post-retrieval fate of items in long-term memory is the focus of this chapter. 
First I will briefly present data from an experimental system that has been studied in our 
laboratory for several years now and offers advantages for the study of consolidation 
as well as of post-retrieval processes. This system is conditioned taste aversion (CTA) 
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in the rat. I will then proceed to propose that to properly construe the data on the 
formation, stability and post-retrieval fate of long-term memory items, we might wish 
to postulate the existence of a special memory state, which I will call the post-activation 
state (PAS). This state, so goes the claim, plays a critical role in the life of memories. 


The fates of the CTA trace 


CTA is the learned association of taste with visceral distress (Garcia et al. 1955; Bures 
et al. 1998; Rosenblum et al. 1993). In the typical protocol employed in our laboratory 
(Rosenblum et al. 1993), the rat is presented with a pipette containing a solution of 
an unfamiliar tastant, such as saccharin or glycine and, an hour or so later, is injected 
i.p. with a solution of LiCl, which produces visceral malaise for about two hours. 
In the test, usually performed a few days later, the rat is presented with a choice of 
two pipettes, one containing the malaise-associated tastant, the other a familiar, non- 
conditioned tastant. Conditioned rats prefer the non-conditioned over the conditioned 
tastant. In terms of classical (Pavlovian, elemental) conditioning, one might consider 
the conditioned tastant to be the conditioned stimulus (CS), the malaise-inducing agent 
to be the unconditioned stimulus (UCS), and taste rejection to be the conditioned 
response (CR). Many tastants can be used as CS, and other aversive agents, such as 
anxiogenic drugs, can be substituted for the LiCl as the UCS (Guitton and Dudai 2004). 
In contrast with other protocols of classical conditioning, which do not tolerate a CS- 
UCS interstimulus interval (ISI) of longer than a few seconds, CTA can tolerate an ISI 
of up to a few hours, implying that one can use the paradigm to investigate incidental 
learning (that of the CS in isolation, i.e. the formation of a novel taste memory) in 
addition to associative learning. 

CTA is readily obtained after a single training trial, and the resulting memory lasts 
for many months (Berman et al. 2003) if not for a lifetime. However, if tested repeatedly 
in the absence of the UCS, memory extinguishes rather readily (Berman and Dudai 
2001). CTA memory can, nevertheless, be rendered quite resistant to extinction if two 
training trials, 24 hours apart, are used instead of a single training trial (Berman et al. 
2003; Eisenberg et al. 2003). Thus CTA is fit to be used in the investigation of fast 
incidental and associative encoding, long-term memory, and extinction. 

Since CTA is readily induced in the laboratory rat, it can be analyzed at multiple 
levels, ranging from the behavioral to the molecular, or vice versa. In our laboratory, 
we use a combination of methods that span such multiple levels of analysis, ranging 
from behavioral studies, via lesions (both ablations and cytotoxic), cellular recordings 
in the behaving rat, targeted pharmacology (by microinfusions of selected agents into 
identified brain areas), and biochemistry and molecular biology of selected brain areas 
(Rosenblum et al. 1993, 1995, 1996, 1997; Lamprecht and Dudai 1995, 1996; Naor and 
Dudai 1996; Lamprecht et al. 1997; Berman et al. 1998, 2000; Berman and Dudai 2001; 
Bahar et al. 2003, 2004a,b; Desmedt et al. 2003). Most of our studies so far have focused 
on two major stations in the central pathways that subserve CTA: the taste cortex in 
the insular cortex (abbreviated IC), and two amygdalar nuclei, the central (CeA) and 
the basolateral (BLA) amygdala (Fig. 1). 

All in all, our data support a flowchart model for the formation of the novel taste 
memory in the context of the CTA paradigm, with the following major attributes (Fig. 2): 
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Central gustatory area, insular cortex 


Fig. 1. The central taste system in the rat, which subserves taste learning and memory. Infor- 
mation from the buccal cavity flows via the cranial nerves (VII, IX and X) to the nucleus of 
the solitary tract (NTS), the parabrachial nucleus (PBN), and then via multiple pathways, either 
directly, or via the thalamus (ventroposteriomedial nuclei, VPM) or the amygdala (Am), to the 
central gustatory area (CGA) in the insular cortex (IC). LH, lateral hypothalamus 


on-line information about a tastant converges on the IC. The incoming stimulus also 
activates neuromodulatory systems, specifically the cholinergic and the noradrener- 
gic, that feed onto the IC. The incoming taste information (i.e. the CS information) 
is compared in the brain with off-line taste information, stored in the IC as well. The 
comparator circuit, which functions on a sub-second level, has not yet been identified, 
though it is likely to reside in the IC itself. If a match is detected, i.e., the tastant is 
deemed as familiar and the comparator circuit abates the activation of the aforemen- 
tioned neuromodulatory systems. In the presence of a mismatch, i.e., unfamiliar taste, 
the neuromodulatory systems proceed to trigger a set of intracellular cascades in the IC, 
involving, among others, mitogen-activated protein kinases (MAPK), cAMP-responsive 
binding elements proteins (CREB) and their downstream substrates such as Elk1. This 
process culminates in modulation of gene expression and long-lasting synaptic and 
circuit modifications, which are assumed to embody long-term memory. The default 
reaction of the brain to a tastant is thus to consider that tastant unfamiliar and encode 
its attributes. This makes sense: unfamiliar tastes may kill, and false negatives are much 
more risky than false positives. Two additional points are noteworthy concerning this 
model. First, it doesn’t specify where the CS-US association takes place. The association 
sites in CTA are still a matter of debate (e.g. Bures et al. 1998). There is ample evidence, 
however, that the IC contributes to the formation of the association, and, furthermore, 
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Fig. 2. A flowchart model of the acquisition of long-term taste memory. On-line information 
about a tastant converges on the IC. The stimulus also activates the cholinergic (ACh) and the 
noradrenergic (NA) neuromodulatory systems, which feed onto IC neurons. The incoming in- 
formation is compared with off-line taste information, stored in the IC as well (novelty detection 
circuit). If a match is detected, the comparator abates the activation of the neuromodulatory sys- 
tems. In the presence of a mismatch, i.e., unfamiliar taste, these neuromodulatory systems proceed 
to trigger a set of intracellular cascades in the IC, involving, among others, mitogen-activated pro- 
tein kinases (MAPK, ERK1/2), other protein kinases (PKs), cAMP-responsive binding elements 
proteins (CREB) and their downstream substrates such as Elk1, and culminating in modulation 
of gene expression and long-lasting synaptic and circuit modifications, which are assumed to 
subserve long-term memory. The default reaction of the brain to a tastant is to consider it un- 
familiar and encode its attributes. Glu, glutamate; mAChR, muscarinic acetylcholine receptor; 
NMDA, N-methyl-D-aspartate glutamatergic receptor; A/K, AMPA/Kainate glutamatergic recep- 
tors; mGluR, metabotropic glutamatergic receptors; DA, dopamine; D1,5, dopaminergic receptors; 
BAD, B-adrenergic, receptors PKs, protein kinare systems; CB1, cannabinoid receptor 1; LTM, 
long-term memory. The cellular location of the receptors in the scheme is merely for illustration 


to its storage over time, once it has been formed (Stehberg & Dudai and Shema Dudai, 
in preparation). Second, given the critical importance of CTA for survival, it appears 
that evolution has embedded backup circuits in the brain so that some reduced level 
of CTA can be obtained even in the absence of IC; the IC, nevertheless, plays a major 
role in encoding and retaining CTA memory in the intact rat. 

So much for the encoding of long-term taste memory. But what happens when the 
memory is retrieved? As noted above, upon retrieval in the absence of the reinforcer (i.e., 
the UCS), a single-trial encoded CTA memory starts to extinguish. Ample evidence, 
obtained from many different systems, indicates that extinction is relearning of a CS- 
noUCS association rather than unlearning of the CS-UCS association (Pavlov 1927; 
Rescorla 1996). This finding immediately indicates that, in the retrieval situation, at 
least two traces are playing a role: the original, CS-UCS association, and the CS-noUCS 
association. We have identified additional traces that play a role as well, for example, the 
trace of the retrieval experience itself: recalling something bad is itself a bad experience 
(Berman etal. 2003). In addition, shortly after retrieval, the original CS-UCS association 
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may regain control over behavior, in a process aptly termed “spontaneous recovery”. 
The post-retrieval period seems to be characterized by the expression of multiple traces 
that compete over the control of the organism’s behavior (Berman et al. 2003; Eisenberg 
et al. 2003; Fig. 3). 

Memory consolidation windows are commonly delineated by testing the suscep- 
tibility of the memory trace to amnestic agents immediately after encoding. A widely 
used amnestic agent is the protein synthesis inhibitor, anisomycin. When microinfused 
into the IC within the first hour or so after training, but not afterwards, anisomycin 
blocks subsequent long-term CTA (Rosenblum et al. 1993). Short-term CTA is unaf- 
fected by this treatment. The question arises, since memory extinction is assumed to 
represent new learning rather than unlearning, will it also be blocked by microinfusion 
of anisomycin into the IC immediately after the extinction training (i.e., the retrieval 
trial in the absence of the reinforcer)? The answer is yes (Berman and Dudai 2001). 
Extinction, like the original association, consolidates. 

But the extinction trace is not the only trace that can be blocked by the amnes- 
tic agent immediately after the retrieval session. As noted above, long-term CTA that 
is induced after two training trials (intensive training), 24 hrs apart, is rather resis- 
tant to extinction. Whereas application of anisomycin into the IC after retrieval of 
the regular training trace blocks extinction, the same treatment after retrieval of the 
intensive training trace results in amnesia of the original trace, rather than of the ex- 
tinction trace. In other words, if the extinction trace controls behavior after retrieval, 
it becomes susceptible to the amnestic agent, whereas if the original trace controls 
behavior, it is this trace that becomes susceptible to the amnestic agent. Therefore, 
the susceptibility of the trace to the amnestic agent after retrieval is inversely pro- 
portional to trace dominance, where dominance is the ability of the trace to control 
behavior (Eisenberg et al. 2003; Fig. 4). This competition, observed at the behav- 


Fig. 3. Multiple memory traces become ac- 
tive following a non-reinforced retrieval of 
CTA memory. Among these traces are the 
expressed original CS-UCS association (ex- 
citatory trace, @), the new, CS-noUCS as- 
sociation (extinction, inhibitory trace; @) 
and the memory of the retrieval experi- 
ence (©). In addition, other associations of 
the retrieval cue may become active, and 
the original CS-UCS association sponta- 
neously recovers over time in the absence 
of new taste input (O). The shape of the ex- 
pressed traces in the figure is merely for the 
Time sake of illustration 
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Fig. 4. Inverse correlation between trace dominance and post-retrieval sensitivity to an amnestic 
agent. The graphs show the effect of microinfusion of the protein synthesis inhibitor anisomycin, 
which is an amnestic agent, into the IC. Left panel: After a single CTA training trial (regular 
training), the trace extinguishes readily (-ani). Microinfusion of anisomycin into the insular 
cortex immediately after retrieval (arrow), under conditions that block consolidation of the 
original trace (Rosenblum et al. 1993), blocks extinction (+ani). Right panel: After two training 
trials, 1 day apart (intensive training), the trace becomes much more resistant to extinction 
(-ani). The effect of microinfusion of anisomycin into the insular cortex immediately after 
retrieval (arrow) is now reversed; the protein synthesis inhibitor leads to apparent amnesia of 
the original association (+ani). (Adapted from Eisenberg et al. 2003) 


ioral level, may arise from competition of traces over shared plasticity resources at 
the cellular level. Competition over protein-synthesis-dependent plasticity resources 
has recently been demonstrated in a popular cellular model of plasticity, long-term 
potentiation (LTP), in the hippocampus (Fonseca et al. 2004). It is noteworthy that 
not all the post-retrieval traces are susceptible to the amnestic agent; the trace re- 
sulting from the CTA-retrieval experience is unaffected by anismoycin (Berman et al. 
2003). 

The post-retrieval CTA data are in line with the reconsolidation hypothesis, namely, 
that the reactivated long-term trace may have to undergo consolidation anew (Mis- 
anin et al. 1968; Sara 2000; Nader et al. 2000; Nader 2003; Dudai 2004). These data, 
however, do not prove that reconsolidation indeed takes place, let alone that it is re- 
capitulation of consolidation. Ample evidence indicates that, in CTA, the molecular, 
circuit and behavioral signatures of the reconsolidation process differ from those of 
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consolidation, and also from those of extinction (Bahar et al. 2004a; Kobilo, Stehberg 
and Dudai, unpublished). Differences between reconsolidation and consolidation have 
been demonstrated in other systems as well (reviewed in Dudai 2004, 2006). Indeed 
it would have been preferable to drop the term reconsolidation all together. However, 
this term has gained ground in the literature and is impractical to uproot - not unlike 
“memory extinction”, which is not extinction at all. 

The story is even more intriguing. Whereas consolidation has been detected in 
every species and long-term memory protocol analyzed so far, reconsolidation is not 
universal and appears to be constrained by boundary conditions, of which trace domi- 
nance (see above) is only one. In some systems, another boundary condition is the age of 
memory: older memories are rendered less amenable to reconsolidation (Suzuki et al. 
2004; Eisenberg and Dudai 2004). Still another boundary condition is a mismatch be- 
tween what the animal expects and what actually occurs; such a mismatch was reported 
to promote reconsolidation (Pedreira et al. 2004). Engagement of a memory-encoding 
mode in the reactivation session was also suggested to promote reconsolidation (Mor- 
ris et al. 2006). It is noteworthy that the mismatch of the expected and the actual, i.e., 
surprise, drives encoding (Rescorla and Wagner 1972). 


Post-Activation State (PAS) 


The data on memory reconsolidation, of which the CTA data are only an example, are 
consistently evaluated in the literature within the context of the conceptual framework 
of the dual-trace hypothesis, which assumes the existence of two major memory phases 
(short and sensitive to amnestic agents, long and stable), and of consolidation, which 
converts the ephemeral trace into the seemingly secure world of engrams. My opinion is 
that this discussion is not very productive and reinforces mental stagnation, confined 
by the zeitgeist (for a similar attitude, see Nader et al. 2005). Paradigms deserve to 
be questioned occasionally. The current flare of data on the post-retrieval fate of the 
memory trace provides a proper opportunity to do just that. These data question the 
currently dominant dual-trace and consolidation (DTC) paradigm. 

Provoked by the data and by earlier questioning of the dual-trace hypothesis (Lewis 
1979) and of consolidation (Nadel and Moscovitch 1997), I propose recontemplation of 
the DTC paradigm. I do not consider reconsolidation to be recapitulation of a consol- 
idation process; rather, it is a manifestation of a more general state in which the trace 
exists whenever it phases out of an activity period. What we consider reconsolidation 
is thus a temporarily altered state of the trace following memory reactivation. This 
post-activation state (PAS) is unveiled by increased sensitivity to amnestic agents, such 
as inhibitors of macromolecular synthesis (Dudai 2006). The increased sensitivity is 
probably a manifestation of enhanced plasticity of the neuronal circuit that encodes the 
memory trace or parts of it. Similarly, what we consider consolidation is a temporarily 
altered state of the trace following memory activation, again characterized by increased 
sensitivity to amnestic agents because of enhanced plasticity. The rule is, whenever the 
trace is active, it enters a state that is different from its inactive state. Reconsolidation 
and consolidation are different tokens of the same type of state, PAS. Seen this way, the 
trace doesn’t exist in short- and long-term states but always alternates between active 
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and inactive states (Lewis 1979). Active states occur in encoding (which in real-life in- 
volves retrieval of previous knowledge, as no brain is a tabula rasa), in retrieval (which 
in real-life involves encoding, as no retrieval replicates exactly the encoding state and 
context), and probably in background processing and maintenance of traces, such as 
in sleep (Siapas and Wilson 1998). PAS is a recurrent rite of passage of the engram, 
occurring whenever there is a transition between activity and inactivity of the circuit 
that encodes the internal representation. The circuit is never the same again. 


What PAS is, what triggers it, and what is it for 


What is the proposed PAS? The basic idea is that a surge of coherent activity in the neu- 
ronal circuit that encodes the internal representation relevant to the encoded, processed 
or retrieved item triggers a state of plasticity that is different from the period in which 
the representation is inactive. The fact that activity triggers plasticity is, of course, the 
keystone of the entire discipline of molecular neurobiology of learning, which rests 
on the assumption that teaching stimuli, encoded in action potentials or transmitters 
or neuromodulators, trigger synaptic and cell-wide plasticity mechanisms, which cul- 
minate in synaptic change (Kandel et al. 2000; Dudai 2002a). Here the expanded or 
added assumptions are: 1) activation of a representation induces a state of plasticity 
in critical nodes of the circuit that encodes that representation, regardless of whether 
the trigger for activation is external (i.e., sensory) or internal (i.e., other circuits in the 
brain); 2) this happens not only in encoding but also in retrieval and possibly also in 
implicit processing of the representation; 3) this state of plasticity outlasts the period 
of representational activity, yet terminates within minutes to a few hours at most; and 
4) despite 1 and 2 above, differences are expected in the fine details of the induced 
plasticity because different states (e.g., encoded in global rhythms; Hasselmo et al. 
2002; Csicsvari et al. 2003) allow the brain to distinguish between encoding, retrieval 
and processing. Since windows of enhanced plasticity disturb homeostasis (Dudai 
2002a) and set in motion intracellular signal transduction cascades and translational 
control mechanisms, the system is likely to display modified sensitivity to stimuli 
and metabolic blockers. This transient modified sensitivity is assumed to provide the 
biochemical basis for the transient effectiveness of amnesic agents. Because different 
amnestic agents target different metabolic processes in the cell, one should expect the 
timing of the windows of sensitivity to amnestic agents to depend on the agent used, 
and this is indeed the case (Dudai 2004). 

What is the biological function of PAS? First and foremost, one could question the 
question: assuming a role is the default of the adaptionist paradigm, which posits that 
biological phenomena always have phylogenetically selected roles (Gould and Lewontin 
1979). It is still possible that PAS exists because the biological system cannot function 
otherwise, for example, because intracellular molecular networks may not be able 
to switch among (relatively) stable states instantaneously. But suppose the adaptionist 
point of view is valid. An appealing possibility is that PAS allows the system to better in- 
tegrate the new information into the knowledge base of the organism, or update it. There 
is no lack of candidate cellular correlates for such integration, among them synaptic tag- 
ging (Frey and Morris 1997) and synaptic capture (Martin et al. 1997), all of which show 
that during the first minutes or hours after activation, neurons are particularly tuned 
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to their recent history, integrate and redistribute inputs. In fact, the morphological re- 
modeling observed in some circuits following activation may represent the formation 
of associations and retrieval links, a must for efficient usability of information, rather 
than stabilization of the encoded new representation per se (Dudai 2004; Dudai and 
Eisenberg 2004). Another possibility is that PAS serves to strengthen traces that are 
used recurrently, because it is important that they remain functional over time (for an 
example of a cellular mechanism that might permit just that, see Tronson et al. 2006). 

Seen this way, PAS is one of the three critical rites of passage of the engram. The 
first is encoding, in which the engram is born. The second is retrieval, in which the 
engram is expressed. The third is PAS, in which the engram is updated. 


Some repercussions of the PAS hypothesis 


On static vs. dynamic models of memory 


Hebb, the most influential guide of modern dual-trace models, presented a dynamic 
universe of neuronal ensembles that encode internal representations (1949), but sub- 
sequent accounts of the Hebbian universe too often tended to depict the consolidated 
long-term trace as a relatively immutable mnemonic unit, falling for the lures of erro- 
neous “storage metaphors” of memory persistence (discussed in Roediger 1980; Dudai 
2004). The PAS hypothesis clearly argues for a dynamic view of long-term memory, 
in which internal representations are in continual flux; furthermore, it argues that the 
magnitude of this flux over time is proportional to the extent the representation is 
used. It is noteworthy that, in theory, flux doesn’t necessarily imply significant change 
in content. Consider, for example, the classical Ship of Theseus problem, sometimes in- 
voked in discussions of memory persistence (Dudai 2002a,b). The ship of the mythical 
Greek hero was placed on display in Athens, and with time, parts of it were replaced, 
one by one, until none of the original remained. Is this still the same ship? The same 
applies to other constructs, for example, the Pont Neuf in Paris, in which nota stone in 
the current structure remains in place from its original construction (Jones 2006). In 
both cases, however, flux retained the original structure. The PAS hypothesis, similarly 
to prominent cognitive accounts of memory (Loftus and Loftus 1980; Schacter 2001), 
assumes something more, namely, that the flux changes not only the physical substance 
but also the content and shape. 


On metaphors 


Some proponents of consolidation seem to account for the observed lingering sensi- 
tivity of declarative long-term memory items to amnesic agents by postulating that 
consolidation lingers for many years (Haist et al. 2001; for a different interpretation 
of this type of data, see Nadel and Moscovitch 1997). This extension of consolidation 
windows over decades questions the validity of the consolidation concept, as it raises 
the possibility that consolidation never ends, hence the trace is always prone to interfer- 
ence, hence consolidation does not exist. PAS, in contrast, does not annul the idea that 
some type of consolidation occurs; it only claims that, if consolidation does occur, it 
is relevant only until the subsequent activation of the relevant internal representation. 
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Fig. 5. Post-Activation State (PAS). The neuronal population that encodes the memory enters 
a special plastic state, characterized by increased sensitivity to amnestic agents each time it 
is active and phases into an inactive state. The inactive state is the state in which the internal 
representation encoded by the neuronal population is not coherently activated and is incapable of 
controlling behavior. The trace persists in an inactive state until further activation in retrieval or 
maintenance. Alternatively, the trace may also become transiently activated in maintenance, e.g. 
sleep. The expression of PAS at any given activation period of the trace is constrained by boundary 
conditions, such as competition with other active traces over plasticity resources or engagement 
of encoding mode in the activation session (see text). PAS comes in multiple mechanistic variants, 
and is manifested in the processes we call consolidation and reconsolidation. Memory traces are 
thus considered to exist in two alternating states, active and inactive (Lewis 1979), rather than 
in short- and long-term (consolidated) states. According to this view, each item in long term 
memory undergoes three types of rites of passage, encoding, retrieval, and PAS; items never 
really consolidate, or alternatively, consolidate only temporarily, from one PAS to another 


In this respect, the PAS hypothesis can be said to convert the storage metaphor into 
a Phoenix metaphor: memories are reborn each time anew and mature only until their 
next reincarnation (Fig. 5). Metaphors in science should always be treated with caution, 
as they seldom depict reality; in the present case, whereas the mythical Phoenix dies 
only to be born anew, the mnemonic one doesn’t die, only rejuvenates. Still, metaphors 
are not merely poetic devices that facilitate the communication of ideas. They are use- 
ful in guiding our scientific as much as our poetic thinking. Therefore, in the universe 
of metaphors, the Phoenix is more useful than the storage in guiding our research 
programs. 


On working memory as a memory-updating space 


Working memory is considered to be the memory system that holds information 
in temporary storage during the planning and execution of a task (Baddeley 2000; 
Repvos and Baddeley 2006). In doing so it is postulated to ephemerally combine on- 
line (sensory) and off-line (memory) information (Dudai 2004). The PAS hypothesis 
adds another dimension, or function, to working memory, which is providing updating 
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space for long-term memory. Seen that way, working memory has two different, though 
clearly not mutually exclusive, roles: permitting the organism to combine present and 
past information for achieving an immediate goal, and enabling it to adjust knowledge 
about a changing world for future use. 


On the nature of amnesia 


Global amnesia has been portrayed over the years as a storage deficit (Squire 1987) and, 
by some, as a retrieval deficit (Warrington and Weiskrantz 1982; see also Riccio et al. 
2003; de Hoz et al. 2004). It is possible that some types of amnesia are a PAS pathology. If 
PAS is truncated, or excessively sensitive to interference, amnesia will result. This is not 
merely semantics, as it may guide potential research avenues. Similarly, spontaneous 
confabulations may represent PAS deficits, in which updating retrieved information is 
faulty (Talland 1965). 


On practical implications 


Whereas potential prevention or amelioration of some types of amnesia is a remote 
possibility, prevention or amelioration of post-traumatic stress disorder (PTSD) is not. 
The revival of the reconsolidation literature (Sara 2000; Nader et al. 2000; Nader 2003) 
has incited attempts to modify or even erase long-term traumatic memories long after 
their formation. The PAS hypothesis supports such attempts. But additional practical 
implications of the PAS notion could be incited by assimilating into neuroscience 
a notion that is so familiar to cognitive psychologists, namely, that the life span of 
an item in memory persists only from one activation to another, and that memories, 
primarily declarative ones, change upon their use. Rites of passage, in which these 
changes are likely to take place, are not only windows of opportunity to peek into the 
mechanisms of memory; they also provide potential therapeutic windows. 


Acknowledgements. | am grateful to Karim Nader, Joe LeDoux, Richard Morris, Mark Fisen- 
berg, Jimmy Stehberg, and Tali Kobilo for valuable discussions; to the Volkswagen Foundation, 
the Human Frontiers Science Program (HFSP), the Israel Science Foundation (ISF), and the Nella 
and Leon Benoziyo Center for Neurological Diseases for support; and to the Fondation IPSEN, 
Paris, for organizing a marvelous, effective and educating meeting on Memories: Molecules and 
Circuits. 


References 


Baddeley A (2000) The episodic buffer: a new component of working memory? Trends Cogn Sci 
4:417-423 

Bahar A, Samuel A, Hazvi S, and Dudai Y (2003) The amygdalar circuit that acquires taste aversion 
memory differs from the circuit that extinguishes it. Eur J Neurosci 17:1527-1530 

Bahar A, Dorfman N, Dudai Y (2004a) Amygdalar circuits required for either consolidation or 
extinction of taste aversion memory are not required for reconsolidation. Eur J Neurosci 
19:1115-1118 

Bahar A, Dudai Y, Ahissar E (2004b) Neuronal signature of familiarity in the taste cortex of the 
behaving rat. J Neurophysiol. 92:3298-3308 


80 Y. Dudai 


Bartlett FC (1932) Remembering. A study in experimental and social psychology. London: Cam- 
bridge University Press 

Berman DE, Dudai, Y (2001) Memory extinction, learning anew, and learning the new: dissocia- 
tions in the molecular machinery of learning in cortex. Science 291:2417-2419 

Berman DE, Hazvi S, Rosenblum K, Seger R, Dudai Y (1998) Specific and differential activation 
of mitogen-activated protein kinase cascades by unfamiliar taste in the insular cortex of the 
behaving rat. J Neurosci 18:10037-10044 

Berman DE, Hazvi S, Neduva V, Dudai Y (2000) The role of identified neurotransmitter systems 
in the response of the insular cortex to unfamiliar taste: activation of ERK1-2 and formation 
of a memory trace. J Neurosci 20:7017-7023 

Berman DE, HazviS, Stehberg J, Bahar A, Dudai Y (2003) Conflicting processes in the extinction of 
conditioned taste aversion: behavioral and molecular aspects of latency, apparent stagnation, 
and spontaneous recovery. Learn Mem 10:16-25 

Bures J, Bermudez-Rattoni F, Yamamoto T (1998) Conditioned taste aversion. Memory ofa special 
kind. NY: Oxford University Press 

Csicsvari J, Jamieson B, Wise KD, Buzsáki G (2003) Mechanisms of gamma oscillations in the 
hippocampus of the behaving rat. Neuron 37:311-322 

de Hoz L, Martin SJ, Morris RGM (2004) Forgetting, reminding, and remembering: the retrieval 
of lost spatial memory. PLoS Biol 2:1233-1242 

Desmedt A, Hazvi S, and Dudai Y (2003) Differential pattern of CREB activation in the rat brain 
after conditioned aversion as a function of the cognitive process engaged: Taste vs. context 
association. J Neurosci 23:6102-6110 

Dudai Y (2002a) Memory from A to Z. Keywords, concepts and beyond. Oxford: Oxford University 
Press 

Dudai Y (2002b) Molecular bases of long-term memories: a question of persistence. Curr Opin 
Neurobiol 12:211-216 

Dudai Y (2004) The neurobiology of consolidations, or, how stable is the engram? Ann Rev 
Psychol 55:51-86 

Dudai, Y (2006) Reconsolidation: the advantage of being refocused. Trends Cogn Sci 16:174-178 

Dudai Y, Eisenberg M (2004) Rites of passage of the engram: reconsolidation and the lingering 
consolidation hypothesis. Neuron 44:93-100 

Eisenberg M, Dudai Y (2004) Reconsolidation of fresh, remote, and extinguished fear memory 
in medaka: old fears don’t die. Eur J Neurosci 20:3397-403 

Eisenberg M, Kobilo T, Berman DE, Dudai Y (2003) Stability of retrieved memory: Inverse 
correlation with trace dominance. Science 301:1102-1104 

Fonseca R, Nagerl UV, Morris RGM, Bonhoeffer T (2004) Competing for memory: hippocampal 
LTP under regimes of reduced protein synthesis. Neuron 44:1011-1020 

Frey U, Morris RGM (1997) Synaptic tagging and long-term potentiation. Nature 385:533-536 

Garcia J, Kimmeldorf DJ, Koelling RA (1955) Conditioned taste aversion to saccharin resulting 
from exposure to gamma radiation. Science 122:157-158 

Gould SJ, Lewontin RC (1979) The spandrels of San Marco and the Panglosian paradigm: a critique 
of the adaptionist programme. Proc Roy Soc London B 205:581-598 

Guitton MJ, Dudai Y (2004) Anxiety-like state associates with taste to produce conditioned taste 
aversion. Biol Psychiat 56:901-904 

Haist F, Bowden Gore J, Mao H (2001) Consolidation of human memory over decades revealed 
by functional magnetic resonance imaging. Nature Neurosci 4:1139-1145 

Hasselmo ME, Bodelon C, Wyble BP (2002) A proposed function for hippocampal theta rhythm: 
separate phases of encoding and retrieval enhance reversal of prior learning. Neural Comput 
14:793-817 

Hebb DO (1949) The organization of behavior: a neuropsychological theory. New York: Wiley. 

Jones C (2006) Paris. A biography of a city. New York: Penguin Books 

Kandel ER, Schwartz JH, Jessell TM (2000) Principles of neural science. New York: McGraw-Hill 


Post-Activation State: A Critical Rite of Passage of Memories 81 


Lamprecht R, Dudai Y (1995) Differential modulation of brain immediate early genes by in- 
traperitoneal LiCl. NeuroReport 7:89-293 

Lamprecht R, Dudai Y (1996) Transient expression of c-Fos in rat amygdala during training is 
required for encoding conditioned taste aversion memory. Learn Mem 3:31-41 

Lamprecht R, Hazvi S, Dudai Y (1997) cAMP response element-binding protein in the amygdala 
is required for long- but not short-term conditioned taste aversion memory. J Neurosci 
17:8443-8450 

Lewis DJ (1979) Psychobiology of active and inactive memory. Psychol Bull 86:1054-1083 

Loftus EF, Loftus GR (1980) On the permanence of stored information in the human brain. Am 
Psychol 35:409-420 

Martin KC, Casadio A, Zhu H, Yaping E, Rose JC, Chen M, Bailey CH, Kandel ER (1997) Synapse- 
specific, long-term facilitation of Aplysia sensory to motor synapses: A function for local 
protein synthesis in memory storage. Cell 91:927-938 

Miller MM, Altemus M, Debiec J, LeDoux JE, Phelps EA (2004) Propranolol impairs reconsolida- 
tion of conditioned fear in humans. Soc Neurosci Abst 208.2 

Misanin JR, Miller RR, Lewis DJ (1968) Retrograde amnesia produced by electroconvulsive shock 
after reactivation of consolidated memory trace. Science 160:554-555 

Morris RGM, Inglis J, Ainge JA, Olverman HJ, Tulloch J, Dudai Y, Kelly PAT (2006) Reconsolidation 
of spatial memory: Differential sensitivity of distinct spatial memory tasks to local inhibition 
of protein-synthesis in dorsal hippocampus following memory retrieval. Neuron 50:479-489 

Nadel L, Moscovitch M (1997). Memory consolidation, retrograde amnesia and the hippocampal 
complex. Curr Opin Neurobiol 7:217-227 

Nader K (2003) Memory traces unbound. Trends Neurosi 26:65-72 

Nader K, Schafe GE, LeDoux JE (2000) Fear memories require protein synthesis in the amygdala 
for reconsolidation after retrieval. Nature 406:722-726 

Nader K, Hardt O, Wang S-H (2005) Response to Alberini: right answer, wrong question. Trends 
Neurosci 28:346-347 

Naor C, Dudai Y (1996) Transient impairment of cholinergic function in the rat insular cortex 
disrupts the encoding of taste in conditioned taste aversion. Behav Brain Res 79:61-67 

Pavlov IP (1927) Conditioned reflexes. An investigation of the physiological activity of the cerebral 
cortex. London: Oxford University Press 

Pedreira ME, Perez-Cuesta LM, Maldonado H (2004) Mismatch between what is expected and 
what actually occurs triggers memory reconsolidation or extinction. Learn Mem 11:579-585 

Repvos G, Baddeley A (2006) The multi-component model of working memory: Explorations in 
experimental cognitive psychology. Neuroscience 139:5-21 

Rescorla RA (1996) Preservation of Pavlovian associations through extinction. Q J Exp Psychol 
49B:245-258 

Rescorla RA, Wagner AR (1972) A theory of Pavlovian conditioning: variations in the effec- 
tiveness of reinforcement and nonreinforcement. In: Black AH, Proksay WF (eds) Classical 
conditioning II: Current research and theory. New York: Appleton-Century-Crofts, pp. 64-99 

Riccio DC, Millin PM, Gisquet-Verrier P (2003) Retrograde amnesia: forgetting back. Curr Dir 
Psychol Sci 12:41-44 

Roediger HI (1980) Memory metaphors in cognitive psychology. Mem Cogn 8:231-246 

Rosenblum K, Meiri N, Dudai Y (1993) Taste memory: the role of protein synthesis in gustatory 
cortex. Behav Neural Biol 59:49-56 

Rosenblum K, Schul R, Meiri N, Hadari Y, Zick Y, Dudai Y (1995) Modulation of protein tyrosine 
phosphorylation in rat insular cortex following conditioned taste aversion training. Proc 
Natl Acad Sci USA 92:1157-1161 

Rosenblum K, Hazvi S, Berman DE, Dudai Y (1996) Carbachol mimics the effects of sensory input 
on tyrosine phosphorylation in cortex. NeuroReport 7:1401-1404 

Rosenblum K, Berman DE, Hazvi S, Lamprecht R, Dudai Y (1997) NMDA receptor and the 
tyrosine phosphorylation ofits 2B subunit in taste learning in the rat insular cortex. J Neurosci 
17:5129-5135 


82 Y. Dudai 


Sara SJ (2000) Retrieval and reconsolidation: toward a neurobiology of remembering. Learn Mem 
7:73-84 

Schacter DL (2001) The seven sins of memory: how the mind forgets and remembers. Boston: 
Houghton Mifflin 

Siapas AG, Wilson MA (1998) Coordinated interactions between hippocampal ripples and spin- 
dles during slow-wave sleep. Neuron 21:1123-1128 

Squire LR (1987) Memory and brain. New York: Oxford University Press 

Suzuki A, Josselyn SA, Frankland PW, Masushige S, Silva AJ, Kida S (2004) Memory recon- 
solidation and extinction have distinct temporal and biochemical signatures. J Neurosci 
24:4787-4795 

Talland GA (1965) Deranged memory. A psychonomic study of the amnesic syndrome. New York: 
Academic Press 

Tronson NC, Wiseman SL, Olausson P, Taylor JR (2006) Bidirectional behavioral plasticity of 
memory reconsolidation depends on amygdalar protein kinase A. Nature Neurosci 9:167- 
169 

Tulving E (1983) Elements of episodic memory. Oxford: Oxford University Press 

Warrington EK, Weiskrantz L (1982) Amnesia: a disconnection syndrome? Neuropsychologia 
20:233-248 


Reactivation-Dependent Amnesia: 
Disrupting Memory Reconsolidation as a Novel Approach 
for the Treatment of Maladaptive Memory Disorders 


Jonathan L.C. Lee! and Barry J. Everitt! 


Summary 


The persistence of memories reflects a dynamic, rather than a stable, process. The reac- 
tivation of a previously learned memory, for example by re-exposure to a conditioned 
stimulus, renders it labile, such that the concomitant administration of a variety of 
amnestic agents results in a reduced ability subsequently to retrieve that memory. This 
reactivation-dependent amnesia suggests that memories may undergo cycles of reac- 
tivation followed by a process of what has been called “reconsolidation”. This review 
will discuss several of the central features of memory reconsolidation, taking a trans- 
lational view of the potential of disrupting reconsolidation as a treatment strategy for 
maladaptive memory disorders such as post-traumatic stress and drug addiction. 


It has long been observed that memories undergo a period of stabilization after their 
initial acquisition, during which time they are vulnerable to disruption (see Dudai, 
2004 for review). The use of the term memory “consolidation” (or “konsolidierung” in 
the native German) to describe this phenomenon can be traced back to the writings 
of Miiller and Pilzecker in 1900 (Lechner et al. 1999). Their studies of the learning 
and retrieval of verbal stimuli resulted in the suggestion that the “perseveration” or 
consolidation of stimuli may be necessary to strengthen the associations between them, 
and that interference with this perseveration process impairs associative learning. 
Importantly, delaying the interference resulted in a reduced disruption of subsequent 
retrieval. 

It is notable that the phenomenon described by Miiller and Pilzecker came to be 
explained within the framework of interference theory (Lechner et al. 1999). However, 
consolidation theory has persisted, with its core postulate that molecular events in the 
brain are set in train by a learning episode that takes time to stabilize the memory 
and ensure its persistence (McGaugh 2000). One important neuronal mechanism is the 
synthesis of new proteins that are hypothesized to enable the potentiation of existing 
synapses and the formation of new synapses (Bailey et al. 1996). Thus the blockade of 
protein synthesis during the post-learning period often resulted in subsequent amnesia 
(Davis and Squire 1984; Bourtchouladze et al. 1998; Schafe and LeDoux 2000). This 
retrograde amnesia exhibited a temporal gradient, similar to that observed by Miiller 
and Pilzecker, such that delay of protein synthesis inhibition rendered it ineffective in 
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causing amnesia. Therefore, central to consolidation theory is the assertion that there 
is a temporally limited window following learning during which the newly acquired 
memory can be disrupted by consolidation blockers, but after which the memory is 
stable and fixed in the brain (McGaugh 1966). 

The expression of previously learned memories is known not to be constant over 
time. Several well-established factors, including context, motivation and extinction, 
influence the retrieval and behavioural expression of memories (Bouton and Moody 
2004). However, in such accounts the persistence of the memory trace is not in question, 
consistent with the many observations that the application of consolidation blockers 
long after learning does not affect fully consolidated memories. Nevertheless, there is 
increasing evidence that the retrieval of memories results in them being once more 
vulnerable to disruption by consolidation blockers (so-called reactivation-dependent 
amnesia; see recent reviews by Nader 2003; Dudai and Eisenberg 2004; Alberini 2005). 
Early observations revealed that electroconvulsive shock, when applied many hours 
after fear learning, could result in amnesia for the fear memory, but only if rats were 
re-exposed to the training stimulus or outcome shortly before treatment (Misanin et al. 
1968; Schneider and Sherman 1968). It was thus proposed that memories, instead of 
undergoing a single episode of consolidation, may cycle between active and inactive 
states, with the transition between these states being mediated by memory retrieval 
and reconsolidation (Lewis 1979; Nader 2003). It is a matter of current debate whether 
the disruption of memory reconsolidation reflects an impairment in the storage or 
retrieval of memories (Sara 2000; Millin et al. 2001; Nader 2003), in which the phenom- 
ena of spontaneous recovery, renewal and reinstatement are often discussed. However, 
regardless of the nature of the underlying impairment, blockade of memory recon- 
solidation has recently gained much interest as a potential therapeutic approach for 
neuropsychiatric disorders in which persistent maladaptive memories play a promi- 
nent role, such as post-traumatic stress disorder and drug addiction (Przybyslawski 
et al. 1999; Nader 2003; Lee et al. 2005; Miller and Marshall 2005; Valjent et al. 2006), 
and the present review will focus on this topic. 

Given recent discussion regarding when the term “reconsolidation” should be used 
(Dudai 2006; Rudy et al. 2006), it is pertinent at this stage to stress that, for the purposes 
of this review, “reconsolidation” will be used to describe the putative process that is 
impaired when reactivation-dependent amnesia is observed subsequently. No assump- 
tions are made concerning what the amnesia might reflect, either neurobiologically 
or psychologically, or what the purpose of memory reconsolidation might be. These 
questions are briefly considered at the end of the review. 


Studies of aversive learning and memory reconsolidation 


The recent resurgence in studies of reactivation-dependent amnesia results from the 
demonstration that this phenomenon could be observed following targeted intracere- 
bral infusions of amnestic agents (Nader et al. 2000). This finding enabled the inference 
of a neurobiological mechanism underlying the amnestic effect, which had previously 
not been possible with the use of electroconvulsive shock and interference procedures. 
In particular, the use of the protein synthesis-inhibiting antibiotic, anisomycin, has 
become prevalent in the investigation of reactivation-dependent amnesia. Anisomycin 
has been infused directly into the basolateral amygdala (Nader et al. 2000; Duvarci 
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and Nader 2004; Debiec et al. 2006), dorsal hippocampus (Debiec et al. 2002; Lee 
et al. 2004; Power et al. 2006) and insular cortex (Eisenberg et al. 2003) of rats to re- 
sult in reactivation-dependent amnesia for discrete fear conditioning, contextual fear 
conditioning, inhibitory avoidance and conditioned taste aversion. Thus the apparent 
dependence of reactivated memories upon de novo protein synthesis has encouraged 
the view that reactivation-dependent amnesia is an important phenomenon provid- 
ing an opportunity to investigate the mechanisms of memory persistence (Dudai 2006). 


The protein synthesis-dependence of memory reconsolidation 


Reconsolidation, like consolidation, may therefore involve the synthesis of new pro- 
teins that are necessary to restabilise reactivated memories (Nader et al. 2000; Nader 
2003). However, some doubt must be expressed regarding the conclusions of studies 
that use anisomycin (Routtenberg and Rekart 2005; Rudy et al. 2006). In particular, 
anisomycin activates intracellular signaling cascades that themselves have been impli- 
cated in learning and memory, such as the mitogen-activated protein kinases (Bebien 
et al. 2003), and may also induce apoptosis (Iordanov et al. 1997). Therefore, the use 
of anisomycin alone is insufficient to establish the protein synthesis-dependence of 
memory consolidation and reconsolidation. 

The involvement of transcription factors in memory consolidation and recon- 
solidation (Kida et al. 2002; Merlo et al. 2005) provides further evidence that protein 
synthesis is a functional requirement for these processes. The protein synthesis hypoth- 
esis, however, has been tested in a more specific manner, following the demonstrations 
that specific genes are upregulated following learning or retrieval episodes (Hall et al. 
2000, 2001; Taubenfeld et al. 2001; Strekalova et al. 2003; von Hertzen and Giese 2005). 
The functional importance of the expression of these genes has been tested in rats, 
through the knockdown of their specific proteins by targeted infusions of antisense 
oligodeoxynucleotides (ASO) into the amygdala and hippocampus to impair memory 
consolidation (Guzowski and McGaugh 1997; Guzowski et al. 2000; Taubenfeld et al. 
2001; Guzowski 2002; Lee et al. 2004; Malkani et al. 2004) and reconsolidation (Lee 
et al. 2004, 2005; Tronel et al. 2005). Given that ASO do not have the same non-specific 
and diverse effects as anisomycin, these studies provide further evidence that de novo 
protein synthesis in neurons of discrete brain loci, at least in the rat, is required for 
both consolidation and reconsolidation. 

Reactivation-dependent amnesia for aversive memories has been observed across 
many other species (chicks, fish, snails, and crabs). However, many of these studies have 
involved general treatments such as anisomycin or anaesthesia (Litvin and Anokhin 
2000; Anokhin et al. 2002; Pedreira et al. 2002, 2004; Eisenberg et al. 2003; Pedreira 
and Maldonado 2003; Eisenberg and Dudai 2004; Salinska et al. 2004; Sangha et al. 
2004; Gainutdinova et al. 2005), and there are fewer reports of reconsolidation impair- 
ments with more specific treatments, such as inhibitors of NFkB (Merlo et al. 2005), 
glycosylation (Salinska et al. 2004), and antagonists of NMDA (Pedreira et al. 2002) 
and dopamine (Sherry et al. 2005) receptors. Thus it is important that studies across 
different species use a similar range of treatments to those employed in experiments on 
rodents to provide convergent evidence regarding the neurobiological basis of memory 
reconsolidation. 
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Studies of reconsolidation have mostly included control groups in which the treat- 
ment, often anisomycin, is administered without re-exposing the subject to training 
stimuli to reactivate the memory. Such non-reactivation controls are essential to show 
the reactivation-dependent effects of amnestic agents (Nader 2003; Dudai 2004). Many 
studies generally either include treated and control non-reactivated groups, analyzing 
them independently to show that there is no significant difference between treatment 
and control in the non-reactivated condition (Nader et al. 2000; Debiec et al. 2002; 
Kida et al. 2002; Kelly et al. 2003; Lee et al. 2004; Suzuki et al. 2004; Merlo et al. 2005; 
Tronel et al. 2005), or they include only the treated non-reactivated group, showing 
that it is not different from control reactivated subjects but significantly different from 
those subjects both treated and reactivated (Przybyslawski et al. 1999; Anokhin et al. 
2002; Milekic and Alberini 2002; Lattal and Abel 2004). However, to truly demonstrate 
the reactivation dependence of a treatment in comparison to control, it is necessary 
to show a significant Treatment x Reactivation interaction, where parametric statistics 
are permissible. Demonstration of a significant effect of reactivation in treated groups 
alone does not show that the effect of the amnestic agent depends upon the reactivation 
session, but only that the reactivation session itself has an effect on subsequent be- 
haviour. Moreover, the independent analysis of reactivated and non-reactivated groups 
can be criticized for the adoption of repeated testing without adequate controls. Only 
a complete analysis of all groups can lead to the conclusion that the amnestic effect 
of a treatment is dependent upon the reactivation session. Relatively few studies have 
used such a rigorous design and analysis (Eisenberg et al. 2003; Sangha et al. 2003; Lee 
et al. 2005). 

Therefore, to show the reactivation-dependent effects of amnestic agents unam- 
biguously, the appropriate statistical analysis should be conducted wherever possible. 
On the other hand, observations of reactivation-independent amnesia need not neces- 
sarily be inconsistent with the phenomenon of reconsolidation. Little is known about 
the nature of memory reactivation, and it has been proposed that this might occur 
“implicitly as well as explicitly” (Alberini 2005) or “in the absence of behaviourally 
effective retrieval, e.g., in the course of background processing or sleep” (Dudai and 
Eisenberg 2004). Thus any reactivation-independent amnesia would be difficult to 
interpret and might in fact be expected in translational studies of aversive memory 
disorders such as post-traumatic stress, which is characterized in part by the subjective 
“re-experiencing” of the traumatic event (DSM-IV). It may instead be more important 
to pursue the investigation of the specificity of the amnesia, rather than its reactivation- 
dependence, as this will be an important factor to ensure that there are not widespread 
changes to memories as a result of treatment. A recent finding is reassuring in this re- 
gard, demonstrating that only directly reactivated memories undergo reconsolidation 
(Debiec et al. 2006). 


Temporal gradients of reactivation-dependent retrograde amnesia 


The temporal gradient of the disruptive effect of amnestic agents on memory recon- 
solidation following memory reactivation have also been investigated (Przybyslawski 
et al. 1999; Nader et al. 2000; Anokhin et al. 2002; Debiec et al. 2002; Pedreira et al. 2002; 
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Salinska et al. 2004). It was found consistently that there is a limited time window during 
which reactivated memories are vulnerable to disruption, and this temporal window 
may be shorter than that following initial learning (Debiec et al. 2002). However, again 
it could be argued that the independent analysis of delayed infusion groups, which of- 
ten have smaller numbers of subjects, does not truly test for the temporal-dependence 
of the effects of amnestic agents, for which a combined analysis with an infusion time 
factor is more appropriate. Nevertheless, the consistent finding of a temporally graded 
amnestic effect strongly suggests that memories are transiently sensitive to amnestic 
agents following their reactivation. 

It can be argued further that delayed infusion controls, though undoubtedly in- 
formative, are not necessary for the demonstration of reactivation-dependent amnesia 
and hence reconsolidation impairments. Rather, they help to define the parameters 
of memory reconsolidation. The temporal gradient of retrograde amnesia following 
memory reactivation parallels that following initial memory formation. However, the 
concept of a consolidation gradient is only useful within the theoretical framework of 
stable consolidated memories. The very existence of reactivation-dependent amnesia 
instead argues for a dynamic memory system in which memories cycle between active 
and inactive states (Lewis 1979; Nader 2003). Only if the time course of transformation 
between these memory states is the focus of a study should it be necessary or worthwhile 
investigating the temporal gradients of retrograde amnesia. This information may be 
especially useful in a translational setting to ensure that any treatment is administered 
within an effective time window following memory reactivation. 

Another form of temporal gradient that is observed in studies of memory re- 
consolidation is that of a reduced amnestic effect when a greater period of time has 
elapsed between learning and reactivation-dependent treatment (Milekic and Alberini 
2002; Eisenberg and Dudai 2004). However, old memories have been seen to be dis- 
rupted following reactivation (Debiec et al. 2002). It remains unclear whether mem- 
ories generally become more resistant to reactivation-dependent amnesia with time, 
in which case it should always be possible to observe a temporally graded amnesia 
if sufficient time is allowed to elapse before memory reactivation. Alternatively, dis- 
crepant findings may be related to the different behavioural paradigms employed. 
The examples of a temporal gradient have been observed in inhibitory avoidance in 
rats (Milekic and Alberini 2002) and fear conditioning in Medaka fish (Eisenberg 
and Dudai 2004), but not with contextual fear conditioning (Debiec et al. 2002) or 
drug cue memory conditioning (Lee et al. 2006a) in rats. In either case, it will be 
important to define, in translational animal models, the temporal limitations of the 
use of reconsolidation blockade for the induction of amnesia for maladaptive memo- 
ries. 


Reconsolidation vs. consolidation 


As highlighted above, commonalities between consolidation and reconsolidation in 
aversive memories include a similar dependence upon de novo protein synthesis and 
a limited temporal window of vulnerability. Experimental investigation of memory 
reconsolidation has frequently emphasized similarities between the two processes 
(Nader 2003; Nader et al. 2005), yet there are often notable differences in both the 
neuroanatomical and molecular domains (Lee et al. 2004; Alberini 2005). Thus recon- 
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solidation may be vulnerable for a more limited period than consolidation (Judge and 
Quartermain 1982; Debiec et al. 2002), and it may also be more sensitive to disruption 
(Mactutus et al. 1979; Przybyslawski et al. 1999). Moreover, the neuroanatomical sub- 
strates of memory consolidation and reconsolidation appear to be dissociable, at least 
for some memories, such as those important for inhibitory avoidance and conditioned 
taste aversion (Taubenfeld et al. 2001; Bahar et al. 2004; Tronel et al. 2005). Finally, 
the molecular mechanisms of consolidation and reconsolidation are doubly dissocia- 
ble (Lee et al. 2004), and reconsolidation is selectively impaired both by ASO for the 
immediate early genes Zif268 (Lee et al., 2004) and C/EBP® (Tronel et al. 2005), and by 
the beta adrenergic receptor antagonist propranolol (Debiec and LeDoux 2004). It can 
certainly be argued that any difference between consolidation and reconsolidation can 
be attributed to the inherent differences in experimental protocols, though this cannot 
explain examples of double dissociation (Lee et al. 2004; Nader et al. 2005; Dudai 2006). 

Our previous finding that the molecular mechanisms in the hippocampus underly- 
ing contextual fear memory consolidation and reconsolidation are doubly dissociable 
(Lee et al. 2004) has, we suggest, several profound implications (Fig. 1). The effects of 
intra-hippocampal BDNF ASO to impair fear memory consolidation but not reconsol- 
idation, and intra-hippocampal Zif268 ASO to disrupt reconsolidation only, cannot be 
explained as non-specific effects of ASO infusion into the hippocampus, for example 
in terms of general disruption of neuronal function or by state dependency. Rather, the 
effects are oligodeoxynucleotide sequence-dependent, strongly suggesting that they 
result from the knockdown of BDNF and Zif268 protein, respectively. This finding not 
only confirms that protein synthesis is required for both consolidation and reconsoli- 
dation but also demonstrates that the cellular mechanisms of these two processes are at 
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Fig. 1. The molecular mechanisms in the dorsal hippocampus of contextual fear memory consol- 
idation and reconsolidation are doubly dissociable. Antisense oligodeoxynucleotides (ASO) for 
BDNF impaired consolidation, but not reconsolidation, when infused into the dorsal hippocam- 
pus 90 minutes before learning or reactivation. In contrast, intra-dorsal hippocampal Zif268 ASO 
selectively disrupted reconsolidation but not consolidation when infused at the same time points. 
Thus, the amnesia resulting from BDNF ASO and Zif268 ASO infusion cannot be explained by 
non-specific effects, and the cellular mechanisms of consolidation and reconsolidation are at 
least partially independent (Lee et al., 2004). MSO: control missense oligodeoxynucleotides 
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least partially independent. Therefore, it is not appropriate to describe reconsolidation 
in terms of recapitulating or partially recapitulating the consolidation process. 

The utility of investigating and defining reconsolidation by constant reference 
and comparison to consolidation has recently been called into question (Dudai 2006). 
Commonalities and differences both in neurobiological mechanisms and psychological 
parameters do not provide conclusive evidence regarding the similarity or otherwise 
of the two processes. What seem like contentious issues have arisen more from the 
unfortunate terminology with which the field has been burdened. Consistent with our 
findings of a double dissociation in the underlying hippocampal molecular mecha- 
nisms of contextual fear memory consolidation and reconsolidation (Lee et al., 2004), 
it is generally accepted that reconsolidation is not a faithful recapitulation of consol- 
idation at the mechanistic level (Alberini 2005; Nader et al. 2005; von Hertzen and 
Giese 2005; Dudai 2006). Therefore, the term “reconsolidation” should not be taken 
to imply mechanistic similarity, though the phenomenological commonalities remain 
undeniable (Nader et al. 2005). 

Regardless of any theoretical value of comparing consolidation and reconsolidation 
at molecular, cellular or systems levels, there are certainly practical advantages to such 
an approach. Much is known about the neurobiological mechanisms that mediate the 
stabilization of memory following initial learning (Dudai 2004). The agreement that 
there is at least some overlap between consolidation and reconsolidation provides 
a useful starting point for the investigation of the molecular mechanisms of memory 
reconsolidation. However, we should no longer be surprised when consolidation and 
reconsolidation mechanisms are shown to be dissociable, either singly or doubly. 
Rather, one major challenge will be to delineate those mechanisms that are specific 
to memory reconsolidation (Debiec and LeDoux 2004; Lee et al. 2004; Tronel et al. 
2005). This delineation is especially important from a translational viewpoint, as it will 
be most useful to disrupt selectively the reconsolidation of reactivated maladaptive 
memories, such as those that are important and debilitating in post-traumatic stress 
disorder, without the potential side effect of impairing new memories that are formed 
at the same time. 

The discussion of reconsolidation as a phenomenon should also not be constrained 
by the longevity and dominance of consolidation theory. The very demonstration of 
reactivation-dependent amnesia is itself inconsistent with one interpretation of the 
consolidation hypothesis, namely that memories are fixed and immutable after an 
initial period of stabilization following learning. It should not be concluded, how- 
ever, that retrieval-dependent amnesia invalidates the investigation of consolidation 
as a phenomenon; rather it questions some theories of consolidation. Consolidation 
and reconsolidation impairments resulting in amnesia are empirical findings that 
will enable a better understanding of the persistence and retrieval of memories (Du- 
dai 2006), and future theories of memory will have to account for both phenomena. 
Indeed, modified theories of consolidation have been proposed that accommodate 
reactivation-dependent amnesia (Dudai and Eisenberg 2004; Alberini 2005). These 
theoretical positions have been criticised on the basis that they result in consolidation 
as a term losing heuristic value (Nader et al. 2005). However, this is a confusion that re- 
sults precisely from the different meanings, theoretical and heuristic, of consolidation. 
There is in fact no dispute, as the word “consolidation” is being used in different ways. 
Perhaps it would be useful, when theorizing, to propose hypotheses of memory that 
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encompass both temporal gradients of retrograde amnesia and reactivation-dependent 
amnesia, thereby obviating the use of the terms consolidation and reconsolidation 
entirely. Alternatively, the terms could be used exclusively for theoretical (Rudy et al. 
2006) or empirical/heuristic meaning (Nader et al. 2005). However, the terminology is 
embedded as a useful explanatory tool in both circumstances, and it would be imprac- 
tical, at least in the short to intermediate term, to change it in a coherent manner. Most 
importantly, we must strive as authors to be absolutely clear about the way in which 
we use the terms “consolidation” and “reconsolidation”, and, as readers, not to make 
prior assumptions about how the terms are being employed. 


Studies of non-aversive and appetitive learning 
and memory reconsolidation 


The use of aversive learning paradigms in the study of learning and memory consoli- 
dation has been favored principally due to the utility of single session, or even single 
trial, learning. Similarly, investigation of memory reconsolidation has largely involved 
fear conditioning or avoidance memories. However, there is a growing literature on the 
reconsolidation of non-aversive and appetitive memories; in general, the findings are 
consistent with those of aversive conditioning studies. 

Object recognition in rats is one example of rapidly acquired non-aversive learn- 
ing, both the consolidation and reconsolidation of which appear to depend similarly 
upon protein synthesis in the ventromedial prefrontal cortex (Akirav and Maroun 
2006) and ERK signaling in the hippocampus (Kelly et al. 2003). Zif268 knockout 
mice are also impaired both in the consolidation and the reconsolidation of object 
recognition (Bozon et al. 2003), although the latter of these findings will need to be 
verified, possibly with the use of inducible knockouts that do not also have initial learn- 
ing/consolidation deficits. Reconsolidation impairments have also been observed in 
maze learning tasks in rodents (Przybyslawski and Sara 1997; Przybyslawski et al. 1999; 
Suzuki et al. 2004), as well as for odour-reward memories in both rats (Torras-Garcia 
et al. 2005) and honeybees (Stollhoff et al. 2005). Also of note are demonstrations of 
reactivation-dependent amnesia for incentive learning (Wang et al. 2005), eyelid con- 
ditioning (Inda et al. 2005), and motor learning (Walker et al. 2003), the last of which 
remains the only published demonstration to date of a reconsolidation impairment in 
humans. 

Memory reconsolidation is thus not a phenomenon that is selectively restricted to 
aversive or rapidly acquired memories. However, it may not be universally applicable, as 
reactivation-dependent amnesia has yet to be observed for instrumental and contextual 
memories (Biedenkapp and Rudy 2004; Hernandez and Kelley 2004). Nevertheless, 
much interest has been taken in the demonstration of reactivation-dependent amnesia 
for addictive drug-related memories as a potential treatment strategy for drug addiction 
(Lee et al. 2005; Miller and Marshall 2005; Milekic et al. 2006; Valjent et al. 2006). 
Reconsolidation impairments have been observed in cocaine (Miller and Marshall 2005; 
Valjent et al. 2006) and morphine (Milekic et al. 2006) conditioned place preference 
(CPP) procedures, as well as in the control of cocaine seeking by cocaine cues (Lee et al. 
2005); all of these studies were conducted in rodents. Importantly, the drug memory 
amnesia is both long-lasting (Lee et al. 2005; Miller and Marshall 2005; Milekic et al. 
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2006) and can be observed even after many pairings ofa stimulus with self-administered 
cocaine (Lee et al. 2006a). It will be important to determine further the temporal 
dynamics of drug memory reconsolidation, especially with respect to the persistence 
of the amnesia, and whether older drug memories, established during protracted 
drug-taking histories that more closely model the characteristics of drug addiction in 
humans, are also susceptible to reactivation-dependent amnesia. 


Memory reactivation 


As highlighted previously, the nature of memory reactivation is an important focus of 
current research into reactivation-dependent amnesia. It has been suggested that there 
must be some mismatch between what occurs during memory reactivation and what is 
expected following initial training (Pedreira et al. 2004). Thus reactivation sessions in 
which both the stimulus and outcome are presented (so-called reinforced reactivation) 
preclude the observation of reactivation-dependent amnesia for aversive memories 
under some circumstances (Pedreira et al. 2004; Gainutdinova et al. 2005). On the 
other hand, several studies have observed reconsolidation impairments even when 
using reinforced reactivation (Sangha et al. 2003; Duvarci and Nader 2004; Eisenberg 
and Dudai 2004). However, it is unclear whether simply using an identical session to 
that of training, for example as is commonly the case in object recognition studies 
(Kelly et al. 2003; Akirav and Maroun 2006), is sufficient to eliminate any mismatch 
between expectancy and outcome, especially as such a “prediction error” is thought 
to be important for incremental learning (Rescorla and Wagner 1972; Schultz and 
Dickinson 2000). Thus, if the mismatch hypothesis is correct, reinforced reactivation 
should only be a boundary condition (Nader 2003; Dudai 2006) on the observation of 
reconsolidation impairments if subjects have previously been trained to asymptotic 
levels of performance such that further training trials generate no prediction error. 

A further factor that has been illustrated to great effect by the drug memory recon- 
solidation studies is that, sometimes, reinforced reactivation is necessary in order to 
provide conditions under which an impairment in reconsolidation can be seen follow- 
ing amnestic treatment. Miller and Marshall (2005) observed reactivation-dependent 
amnesia for a cocaine CPP when the reactivation session consisted of a preference 
test (rats were allowed to explore both cocaine-paired and unpaired compartments). 
However, concomitant administration of cocaine was apparently required when the 
rats were confined to the paired compartment during memory reactivation (Valjent 
et al. 2006). Furthermore, morphine administration in association with a preference 
test was necessary to induce a long-lasting amnesia for a morphine CPP (Milekic et al. 
2006). These contrasting findings may be related to differences in the amnestic treat- 
ments employed (Valjent et al. 2006). Alternatively they may reflect the importance 
of the strength of reactivation, such that an increase in the strength of conditioning 
results in a greater degree of stimulus re-exposure being necessary to reactivate the 
memory (Suzuki et al. 2004). Exposure to the drug outcome may therefore be necessary 
to reactivate memories under certain circumstances, and the way in which memories 
are reactivated to enable their reconsolidation to be disrupted is a major consideration 
for both experimental and translational studies. 

Drug-conditioned place preference procedures have used three (Miller and Mar- 
shall 2005; Valjent et al. 2006) or 4 (Milekic et al. 2006) pairings of an environment 
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with non-contingent injections of cocaine or morphine. This paradigm results in 
a significant preference for the paired compartment over a different compartment 
in which the subjects received injections of saline. However, CPP can be readily ex- 
tinguished (Calcagnetti and Schechter 1993; Tzschentke and Schmidt 1995), unlike the 
effects of drug-associated cues to promote craving and relapse in addicts (Conklin and 
Tiffany 2002) and to support cocaine seeking in rats with a more extensive drug self- 
administration history (Di Ciano and Everitt 2004). Therefore, CPP may not model the 
strength and persistence of drug memories as effectively as drug self-administration 
procedures (Lee et al. 2005). It is notable that the studies of drug memory reconsoli- 
dation in rats self-administering cocaine have successfully demonstrated reactivation- 
dependent amnesia following re-exposure to a cocaine-associated cue alone, without 
the need for concomitant cocaine administration (Lee et al. 2005; Lee et al. 2006a). 
Therefore the apparent difficulty with which memories can be reactivated in the CPP 
model of drug addiction may not be inconsistent with our earlier statement that strong 
maladaptive memories might be especially vulnerable to amnestic treatment due to 
implicit or easily induced memory reactivation (see reactivation-dependence of am- 
nesia). Indeed, exposure to cocaine-related videos or drug-related paraphernalia in 
abstinent cocaine addicts readily activates limbic cortical areas and induces craving 
for the drug (Grant et al. 1996; Childress et al. 1999; Garavan et al. 2000). It remains to 
be seen whether maladaptive memories in humans, both aversive and appetitive, are 
easy or difficult to reactivate such that their reconsolidation can be disrupted, and this 
issue will be intimately associated with that of the specificity of experimentally induced 
amnesia. 


Translational applications of reactivation-dependent amnesia 


Through the course of this review, several factors have been discussed that are of 
particular relevance to any translational application of reconsolidation blockade in the 
treatment of maladaptive memory disorders. These include empirical parameters such 
as whether old and well-established memories are vulnerable to reactivation-dependent 
amnesia, and the persistence of the amnesia so-induced. Specificity of the amnesia is 
also an important issue, as it would be problematic for any therapeutic strategy if 
closely related adaptive memories were also disrupted. The reactivation-dependence 
of reconsolidation impairments is a particularly advantageous feature, as it has the 
potential to limit to a high degree of specificity the effects of amnestic agents. However, 
the ways in which memories can be reactivated successfully are poorly understood and 
require further investigation. 

The identification of tractable reconsolidation-specific neurobiological mecha- 
nisms will enable highly selective targeting of the maladaptive memories. Since the 
amnestic treatment in any future clinical application will have to be administered sys- 
temically, there should be a focus on identifying molecular mechanisms that are specific 
to memory reconsolidation. The observation that the beta-adrenergic receptor antag- 
onist, propranolol, disrupts fear memory reconsolidation is therefore of great interest 
(Debiec and LeDoux 2004). However, careful consideration must be given to the effects 
of systemically administered drugs at several neuroanatomical loci and upon other 
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memory mechanisms occurring in distributed neural systems. For example, propra- 
nolol can also impair emotional memory consolidation under certain circumstances 
(Przybyslawski et al. 1999; Cahill et al. 2000). Furthermore, whereas Zif268 expres- 
sion is required specifically in the hippocampus for the reconsolidation of contextual 
fear memories, it has also been implicated in the consolidation of contextual fear in 
the amygdala (Malkani et al. 2004), and Zif268 knockout mice are impaired in object 
recognition memory consolidation (Bozon et al. 2003). Similarly, the expression of 
C/EBPf is specifically required for the consolidation of inhibitory avoidance memories 
in the hippocampus but also for their reconsolidation in the amygdala (Taubenfeld 
et al. 2001; Tronel et al. 2005). Therefore, as discrete neuroanatomical targeting of 
systemically administered pharmacological treatments is unlikely, the potential side 
effects of amnestic treatment must be thoroughly explored. 

A final consideration relates to an alternative treatment strategy for maladaptive 
memory disorders, namely the potentiation of extinction (Davis 2002; Ressler et al. 
2004). Extinction also depends upon de novo protein synthesis, and there is a fine 
balance between whether protein synthesis inhibitors will impair extinction to pre- 
serve memories or disrupt reconsolidation to result in amnesia (Eisenberg et al. 2003; 
Pedreira and Maldonado 2003; Suzuki et al. 2004). Similarly, both extinction and re- 
consolidation can be potentiated, either to diminish memories or to strengthen them 
(Walker et al. 2002; Ledgerwood et al. 2003; Frenkel et al. 2005; Tronson et al. 2006). 
Therefore, it will be of critical importance to ensure that attempts to disrupt the re- 
consolidation or enhance extinction do not instead result in the strengthening and 
increased persistence of maladaptive memories in humans (Lee et al. 2006b). 


Concluding remarks 


The observation that reactivation-dependent amnesia occurs consistently across differ- 
ent memory systems and species, using a variety of amnestic agents, strongly indicates 
that reconsolidation is a phenomenon that must be incorporated into theories of mem- 
ory. This, however, does not address the fundamental questions of what reconsolidation 
is and what its function might be. It has been suggested that reconsolidation may enable 
memories to be restabilised (Nader 2003), updated (Dudai 2004; Dudai and Eisenberg 
2004), or further stabilised and modified (Alberini 2005). Insights into such ideas may 
be provided either through novel behavioural approaches (de Hoz et al. 2004; Tronel 
et al. 2005) or by analysis of simpler animal models with better-defined neural networks 
(Sangha et al. 2003, 2004) or cellular analogues of learning and memory (Fonseca et al. 
2006) to delineate the mechanisms of memory reactivation and reconsolidation at the 
synaptic level. Increasing our understanding of both the psychology and neurobiol- 
ogy of memory reconsolidation will, of course, enable a better characterization of the 
reactivation-dependent amnesia itself (including whether it can be best described as 
a storage or retrieval failure). However, regardless of the eventual answers to these 
questions, it is clear that the possibility of disrupting previously acquired maladaptive 
memories exists and warrants further study. 
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The Organizing Principles of Real-Time Memory Encoding: 
Neural Clique Assemblies and Universal Neural Codes 


Joe Z. Tsien! 


Summary 


Recent identification of network-level coding units, termed neural cliques, in the hip- 
pocampus has allowed real-time patterns of memory traces to be mathematically de- 
scribed, directly visualized, and dynamically deciphered. Those memory coding units 
are functionally organized in a categorical and hierarchical manner, suggesting that 
internal representations of external events in the brain are achieved not by recording 
exact details of those events but rather by re-creating its own selective pictures based on 
cognitive importance. These neural clique-based, hierarchical-extraction and parallel- 
binding processes enable the brain to acquire not only large storage capacity but also ab- 
straction and generalization capabilities. In addition, activation patterns of the neural 
clique assemblies can be converted to strings of binary codes that would permit univer- 
sal categorizations of the brain’s internal representations across individuals and species. 


Neuroscientists try to decipher the brain’s neural codes by searching for reliable cor- 
relations between firing patterns of neurons and behavioral functions (Adrian 1926; 
Gross et al. 1972; Fuster 1973; Funahashi et al. 1989; Zhou and Fuster 1996). As early 
as the 1920s, Edgar Adrian in his pioneering recording showed that the firing rate of 
a frog muscle’s stretch receptor increases as a function of the weights on the muscle 
(Adrian 1926), suggesting that information is conveyed by specific firing patterns of 
neurons. Two leading neural coding theories can be found in the literature, namely, a 
“rate code” and a “temporal code” (Barlow 1972; Van Rullen and Thorpe 2001; Softky 
1995; Eggermont 1998). In the rate code, all the information is conveyed in the changes 
of the firing of the cell. In the temporal code, information is also conveyed in the pre- 
cise inter-spike intervals. However, due to a large amount of response variability at the 
single neuron level in the brain, even in response to an identical stimulus (Bialek and 
Rieke 1992; Fenton and Muller 1998), those two types of single neuron-based decoding 
schemes often produce significant errors in predictions about the stimulus identities 
or external information. 

Changes in discharge frequency or latencies of neurons upon external stimulations 
are well known (Gross et al. 1972; Fuster 1973; O’Keefe and Dostrovsky 1971; Miller et al. 
1993; Thompson 2005; Leutgeb et al. 2005). To explain how memory might be encoded 
beyond the level of synapses, Hebb (1949) postulated that information processing 
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in the brain might involve the coordinated activity of large numbers of neurons, or 
cell assemblies. This notion, although rather vague, makes good sense both from the 
computational and cellular perspectives (Wigstrom and Gustafsson 1985; Bliss and 
Collingridge 1993; Tsien 2000; Abbott and Sejnowski 1999; Shamir and Sompolinsky 
2004; Sanger 2003). The major challenge to date has been to identify the actual patterns 
of activities of a large neuronal population during cognition and then to extract the 
network-level organizing mechanisms that enable the brain to achieve its real-time 
encoding, processing, and execution of cognitive information. 


Measuring and visualizing network-level memory traces 
in the hippocampus 


We have recently developed a large-scale ensemble recording technique in mice and 
used novel categorical episodic memory paradigms and powerful mathematical tools 
to investigate the above questions (Lin et al. 2005). The large-scale ensemble recordings 
have revealed that various robust episodic experiences, such as free-fall inside a plung- 
ing elevator (drop), a sudden gush of air to the back of an animal’s back (air blow), or 
an earthquake-like shake of the home cage (Earthquake), evoked diverse changes in 
the firing of some of the hippocampal CA1 cells (Fig. 1). 

Pattern-classification algorithms, namely, multiple discriminant analysis (MDA) 
and principle component analysis (PCA), reveal that various startle-triggered ensem- 
ble responses of CA1 neurons form distinct patterns in a low-dimensional encoding 
subspace (Fig. 2). Further application of a sliding-window technique to MDA or PCA 
methods enables us to directly visualize and dynamically monitor real-time network 
memory encoding patterns (Fig. 2A). 

Interestingly, post-event processing of newly formed ensemble patterns can also 
be directly detected and precisely quantified (Fig. 2B,C). These spontaneous reactiva- 
tions of memory traces, represented by dynamic trajectories with similar geometric 
shapes but smaller amplitudes, seem to occur at intervals ranging from several seconds 
to minutes after a discrete startling event. The existence of these reactivations sug- 
gests that the memory formation is a highly dynamic process and that they may play 
a crucial role in the immediate post-learning fixation of newly formed memory traces 
(Fig. 2B,C). Previous studies, based on the comparison of firing covariance value of 
place cells with overlapping fields between the running sessions and the post-running 
sleep period, have indicated that place cells participate in reactivations during sleep 
(Wilson and McNaughton 1994). The detection of awake-state re-activations of memory 
encoding patterns immediately following the startling events is generally consistent 
with those interpretations and further illustrates the unprecedented sensitivity of this 
new decoding method. Thus, the combination of large-scale recording and new de- 
coding algorithms begins to open a door for the direct visualization and quantitative 
measurement of network-level memory traces and their dynamic temporal evolution. 


Identification of neural cliques as real-time memory coding units 


To further identify the internal structures underlying the real-time memory encod- 
ing, we have employed the agglomerative hierarchical clustering and sequential MDA 


Memory Coding Units in the Hippocampus 101 


Safe chamber Startle chamber 


A B 
2% 
7 à 
> i Y 
Airblow Elevator drop Earthquake 
Shake 


E Safe chamber 
E Stale chamber 


Percent spent in cach chamber 


Before Training After Training 


tfs] 


Fig. 1. Large-scale simultaneous monitoring of 260 CA1 cells in mice during various mnemonic 
startling episodes. (A) A set of new behavioral paradigms that involve categorical variables is 
used to create discrete episodic memories for mice: a sudden blow of air to the animal’s back (in 
mimicking an owl attack from sky); a short vertical freefall inside a plunging small elevator, and 
an unexpected brief earthquake-like shaking of the mouse’s cage. (B) The formation of robust 
memory about the startling events can be assessed by conditioned place conditioning paradigm. 
The mice spent equal amount of time (~ 50%) between the safe chamber and the startle chamber 
during the pre-training session prior to startle conditioning. However, after startle conditioning, 
the mice spent significantly more time in the unconditioned (safe) chamber, as shown in the 
three-hour retention test (red bar, 130.1 + 5.8 s out of the total 180 seconds, p < 0.0005; numbers 
of mice: n = 14). The figure is adopted from Lin et al. (2005). A spike raster of 260 simultaneously 
recorded single units from mouse-A during a period of 0.5 s prior to and 1 s after the occurrence 
of single startling episodes of Elevator Drop is presented. (t = 0 marked with vertical red line). X- 
axis: Time scale (seconds); Y-axis: the numbers of simultaneously recorded single units (n = 260). 
The startle stimulus duration is indicated as a bar next to the vertical red line above the spike 
raster 
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Fig. 2. Classification, visualization, and dynamic decoding of CA1 ensemble representations of 
startle episodes by Multiple Discriminant Analysis (MDA). (A) Ensemble patterns during awake 
rest (dots, yellow ellipsoid), Air-blow (circles, green ellipsoid), Drop (triangles, blue ellipsoid) and 
Earthquake (stars, magenta ellipsoid) events are shown in a three-dimensional sub-encoding 
space obtained using MDA for a mouse in which 260 CA1 cells were simultaneously recorded; 
MDA1-3 denote the discriminant axes. Three representative dynamic trajectories of network 
patterns during the encoding of each type of startling event are shown. (B) Dynamic monitoring 
of post-learning spontaneous reactivations of network traces during and after the actual startling 
events. The three-dimensional subspace trajectories of the ensemble encoding patterns during 
Drop and Air-blow episodes in the same mouse are shown. The initial response to an actual 
Air-blow or Drop event (black lines) is followed by spontaneous reactivations (red and green lines 
for two air-blow reactivations, and purple line for Drop pattern reactivation), characterized by co- 
planar, geometrically similar lower amplitude trajectories (directionality indicated by arrows). (C) 
The same trajectories of reactivation traces from a different orientation after a three-dimensional 
rotation show that the trajectories are highly specific towards their own startle clusters. These 
post-learning dynamic trajectories are typically smaller in amplitude and take place without any 
time compression, and the numbers of reactivations within the initial several minutes seem to be 
in the range of one to five, with random intervals 


methods (Lin et al. 2005). These analyses reveal that the network-encoding power is 
actually derived from a set of functional coding units, termed neural cliques - a group 
of neurons with a similar response property and selectivity - in the CA1 cell popu- 
lation (Box 1). For example, the “general startle neural clique” consists of individual 
cells capable of responding to all types of startling stimuli, including the elevator drop, 
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Fig. 3. Real-time encoding robustness of memory coding units is achieved through “collective 
co-spiking” ofits individual members within each neural clique. (A) Illustration of a neural clique 
in the CA1 network. The activation function of the clique to drive the downstream clique can 
be mathematically described (equation is listed on top) as a threshold function of the integrated 
inputs from the upstream clique members, where R is input, W is the weighting factor, h is the 
threshold function (e.g., sigmoid), and t is time. (B) Spike rasters and weighted responses of the 
top 10 neurons within the Drop-specific neural clique (listed in Y-axis) during five elevator-drop 
events (1 s before and after the startle, X-axis) are shown as an example. Although responses 
of the individual member (neuron) are quite variable from trial to trial, the consistency and 
specificity of the collective co-spiking of the clique responses is evident from each Drop episode 
(five episodes are listed). The Drop-specific neural clique exhibited no significant responses to 
Air-blow or Shake episodes (last two insets on the right). Robust co-spiking of membership 
neurons in the cliques is also preserved at the finer time-scale (20-30 ms) 


earthquake, and air blow, whereas the “sub-general startle cliques” are neural groups 
that respond to a combination of two types of, but not all, startling events. In addi- 
tion, there are neuron groups that exhibit high specificity towards one specific type 
of startling event, such as elevator drop (the drop-specific neural clique), earthquake 
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Box 1. Categorical and hierarchical organization of the memory-encoding neural clique assem- 
bly. 

The major organizing principle of network-level memory encoding is that memory coding units 
are organized in a categorical and hierarchical manner (Lin et al. 2006). For each neural clique 
assembly underlying the encoding of any given startling episode, there is an invariant internal 
organization, termed the feature pyramid, with the neural cliques corresponding to the general 
features at the bottom and neural cliques corresponding to the highly specific features at the top. 
Such a feature pyramid is evident from the hierarchical clustering analysis of responses of a total 
of 757 CA1 neurons from four mice to the three different types of startling episode, which reveals 
the existence of seven major neural cliques (A): General startle clique, sub-general startle cliques 
(Drop-Shake clique, Air blow-Drop cliques, Shake-Air blow clique), startle type-specific cliques 
(Drop-specific clique, Shake-specific clique, and Air blow-specific clique), and startle context- 
specific clique (Air- blow in context A-specific clique, Air-blow in context B-specific clique, Drop 
in Elevator A-specific clique, and Drop in Elevator B-specific clique). Non-responsive units are 
grouped in the bottom half. The color scale bar indicates the normalized response magnitude 
(1 to 7). A similar cluster is also observed from the simultaneously recorded CA1 populations 
(Lin et al. 2005). 

It is clearly evident that the memory coding units are organized in a hierarchical and categorical 
fashion (B), and any given startling episode is encoded by a combinatorial assembly of a series 
of neural cliques, invariantly consisting of the general startle clique, sub-general startle clique, 
startle identity-specific clique, and context-specific startle clique. In this feature pyramid of the 
encoding clique assembly, the neural clique representing the most general, abstract features (to 
all categories) is at the bottom and it forms a common building block for all types of startle 
event encoding. The next layer of the pyramid is made up of neural cliques responding to less 
general features (covering multiple, but not all, categories); these sub-general cliques are present 
ina subset of the neural clique assemblies. As one moves up this encoding feature pyramid, neural 
cliques become more and more specific. The neural clique at the top of the pyramid encodes the 
most specific and highly discriminate features, thereby defining a particular event or experience. 
Please note that the number of neurons for each clique does not necessarily correspond to its 
position in the feature pyramid. In other words, the neural clique encoding the general features 
does not necessarily have more neurons than the neural cliques encoding more specific features. 
This invariant feature structure within each neural clique assembly encoding startling episodes 
reveals four organizing principles for memory encoding in the brain: 1) the memory system 
employs a categorical and hierarchical architecture in organizing memory coding units; 2) the 
categorical and hierarchical organization of memory coding units suggests that the internal 
representations of external events in the memory space is achieved by re-creating its own selective 
picture, a picture determined largely by what is important for survival and behavioral adaptation; 
3) through combinatorial and self-organizing processes, neural cliques can generate vast numbers 
of unique assembly patterns, thereby providing a network-level mechanism capable of encoding 
potentially infinite numbers of episodic events; and 4) the neural clique-based hierarchical 
extraction and parallel binding in the memory system can also enable higher cognition, such as 
abstraction and generalization capacities, to emerge during the process 


(earthquake-specific neural clique), or sudden air-blow events (air puff-specific neural 
clique). 

One can mathematically evaluate the contribution of these neural cliques to the 
CA1 representations by repeating the MDA analysis while sequentially adding clique 
members to an initial set of non-responsive neurons. For example, a random selection 
of 40 non-responsive cells as an initial set provides no discriminating power, yielding 
only overlapping representations (Lin et al. 2005). In contrast, inclusion of the 10 
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most responsive cells from the general startle clique leads to good separation between 
the rest cluster and the startle clusters, but not among startle clusters. The selective 
discrimination of “drop” startles is obtained by the addition of as few as 10 top neurons 
from the drop clique. Similarly, the inclusion of 10 air-blow clique and 10 shake clique 
top neurons subsequently leads to full discrimination between all startle types. Thus, 
these neural cliques indeed constitute the basic functional coding units for encoding 
the identity of different startling episodes. 

One crucial feature of neural cliques is that the individual neurons belonging to 
a given clique exhibit “collective co-spiking” temporal dynamics (Fig. 3). This collective 
co-spiking dynamics among neural clique members enables the memory coding units 
to achieve real-time, network-level encoding robustness by overcoming the response 
variability of individual neurons (Fig. 3). Moreover, based on the temporal dynamics, 
neurons within each clique can be further sub-grouped into the four major subtypes, 
namely, 1) transient increase, 2) prolonged increase, 3) transient decrease, and 4) 
prolonged decrease. The existence of four types of neurons can greatly enhance the 
real-time encoding robustness as well as provide potential means for modifying clique 
membership via synaptic plasticity. Finally, neural cliques, as network-level functional 
coding units, should also be less vulnerable to the death of one or a few neurons and, 
therefore, exhibit graceful degradation should such conditions arise during the aging 
process or disease states. 


Hierarchical organization within memory-encoding 
neural clique assemblies 


Through examining the overall organization of neural clique assembly involved in 
startle memory encoding, it is clear that the internal CA1 representations of any given 
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startle episode involves a combinatorial set of neural cliques, invariantly consisting of 
the general startle clique, sub-general startle clique, startle identity-specific clique, and 
context-specific startle clique (Lin et al. 2005, 2006). Thus, each clique assembly is orga- 
nized in a categorical hierarchy manner and invariantly consists of a “feature-encoding 
pyramid” (Lin et al. 2006; Box 1): starting with the neural clique representing the most 
general and common features (to all categories) at the bottom layer, followed by neural 
cliques responding to less general features (covering multiple, but not all, common 
categories), and then moving gradually up towards more and more specific and dis- 
criminating features (responding to a specific category), with the most discriminating 
feature clique (corresponding to context-specificity) on the top of the feature-encoding 
pyramid. 

According to this hierarchical structure of network-level memory encoding (Box 1), 
the general startle neural clique represents the neurons engaged in the extraction of 
the common features among various episodes (e.g., encoding abstract and generalized 
knowledge that “such events are scary and dangerous” by integrating neural inputs from 
the amygdala). The sub-general neural cliques are involved in identifying sub-common 
features across a subset of startling episodes (e.g., perhaps, the earthquake and drop- 
specific clique for encoding the semantic memory of the fact that “those events involve 
shaking and motion disturbances” by integrating inputs from the vestibular system), 
whereas the startle identity-specific cliques encode discriminative information about 
startle types (defining “what type” of event has happened) and the startle context- 
specific cliques provide an even more specific feature, such as contextual information 
about “where” a particular startling event has happened. 

This invariant feature-encoding pyramid of neural clique assemblies reveals four 
basic principles for the organization of memory encoding in the brain (Box 1). First, 
the neural networks in the memory systems employ a categorical and hierarchical ar- 
chitecture in organizing memory coding units. Second, the internal representations of 
external events in the brain through such a feature-encoding pyramid is achieved not by 
recording exact details of the external event but rather by re-creating its own selective 
pictures based on the importance for survival and adaptation. Third, the feature- 
encoding pyramid structure provides a network mechanism, through a combinatorial 
and self-organizing process, for creating seemingly unlimited numbers of unique in- 
ternal patterns, capable of dealing with potentially infinite numbers of behavioral 
episodes that an animal or human may encounter during its life. Fourth, in addition to 
its vast memory storage capacity, these neural clique-based, hierarchical-extraction and 
parallel-binding processes also enable the brain to achieve abstraction and generaliza- 
tion, cognitive functions essential for dealing with complex, ever-changing situations. 

The finding that the memory-encoding neural clique assembly appears to invari- 
antly contain the coding units for processing the abstract and generalized information 
(Lin et al. 2005, 2006) is interesting. It fits well with the anatomical evidence that vir- 
tually all of the sensory input that the hippocampus receives arises from higher-order, 
multimodal cortical regions and the hippocampus has a high degree of sub-regional 
divergence and convergence at each loop. This unique anatomical layout supports the 
notion that whatever processing is achieved by the hippocampus in the service of long- 
term memory formation should have already engaged with fairly abstract, generalized 
representations of events, people, facts, and knowledge. 
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The observed feature-encoding pyramid structure of the neural clique assembly is 
likely to represent a general mechanism for memory encoding across different animal 
species. For example, single unit recordings in human hippocampus show that some 
hippocampal cells fire in response to faces or, more selectively, to a certain type of 
human facial emotions; others seem to exhibit highly selective firing to one specific 
person (e.g., the “actress Halle Berry cell”, which fires selectively to her photo portraits, 
Cat-woman character, and even a string of her name (Quiroga et al. 2005). Although 
those cells were not recorded simultaneously, the findings are nonetheless consistent 
with the general-to-specific feature pyramid structure. In addition, it is also reported 
that, whereas some place cells in the rat hippocampus exhibit location-specific firing 
regardless of whether the animals engage in a random forage or goal-oriented food 
retrieval (or make a left or right turn in a T-maze), others seem to fire selectively at their 
place fields only in association with a particular kind of experience (Markus et al. 1995; 
Wood etal. 2000). Thus, those studies also seem to support the existence ofa hierarchical 
structure involved in space coding. Therefore, the hierarchical organization of the 
neural clique assembly, revealed through large-scale recordings of startling episodes 
and mathematical analyses, may represent a general feature for memory encoding in 
the brains. In addition, it further suggests that episodic memory is intimately linked 
with and simultaneously converted to semantic memory and generalized knowledge. 

This form of hierarchical extraction and parallel binding along CNS pathways 
into the memory and other high cognition systems is fundamentally different from 
the strategies used in current computers, camcorders, or intelligent machines. These 
unique design principles allow the brain to extract the commonalities through one 
or multiple exposures and to generate more abstract knowledge and generalized ex- 
periences. Such generalization and abstract representation of behavioral experiences 
have enabled humans and other animals to avoid the burden of remembering and 
storing each mnemonic detail. More importantly, by extracting the essential elements 
and abstract knowledge, animals can apply past experiences to future encounters that 
share the same essential features but may vary greatly in physical detail. These higher 
cognitive functions are obviously crucial for the survival and reproduction of animal 
species. 


Universal activation codes for the brain’s real-time neural 
representations across individuals and species 


With the identification of the neural clique as a basic coding unit and the feature- 
encoding pyramid within the clique assemblies, we can further convert (through ma- 
trix inversion) those distinct ensemble representations observed in a low-dimensional 
encoding-subspace into a string of binary activation codes with 1s and 0s (Fig. 3). This 
binary assignment - 1 for the active state and 0 for the inactive state of neural cliques - 
is based on the idea that the activity state of a neural clique can be monitored by down- 
stream neurons using a biologically plausible binary activation function (McCulloch 
and Pitts 1990; also see Fig. 3). This mathematical conversion of the activation patterns 
of the neural clique assembly into a binary code of 1s and Os creates a simple and con- 
venient way for universally comparing and categorizing network-level representations 
from brain to brain (Lin et al. 2006). 
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Fig. 3. Conversion of activation patterns of neural clique assemblies into a binary code. (A) Con- 
version of the activation state of a neural clique assembly that encodes one type of startling 
event into binary digits 1 or 0. (B) Mathematical transformation of MDA pattern into a startle 
type-specific binary encoding system. While the MDA method provides an efficient separation of 
the startle episodes, each of the discriminant axes at those MDA encoding subspaces (on the left) 
is no longer corresponding to functional meaning. Thus, we used matrix inversion to translate 
the ensemble patterns into a startle-specific encoding coordinate system (on the right). This is 
achieved by assigning new positions for the cluster centers so that they are linearly mapped into 
a “clique-space”, where each axis directly corresponds to a particular clique, thus projecting 
specific activation patterns to 1 and the absence of activation to 0 (top panel). This mathematical 
operation allow us to map the encoding subspace into one where the startle representations can 
directly correspond to neural clique activity patterns and, subsequently, to translate the collec- 
tive activity patterns of neural clique assembly into unique and efficient network-level binary 
activation codes with a string of binary digits (1’s and 0’s). (C) Conversion of activation patterns 
of multiple neural clique assemblies into real-time binary codes. Responses of neural cliques are 
illustrated in different colors. The activation function of a given clique at each network level can 
be mathematically described (Lin et al. 2005). Rows correspond to the different startling episodes, 
whereas columns indicate the different neural cliques (General startle, Air-blow, Drop, Shake, 
Air-blow context-A and Drop context-B). The binary activation patterns corresponding to each 
event can be mathematically converted to a set of binary codes (on the right column, following 
the defined sequence of the cliques). The clique activation codes are: 110010 for Air-blow in 
context A; 110000 for Air-blow in context B, 101000 for Drop in elevator-A; 101001 for Drop in 
elevator B, and 100100 for Shake. This binary code allows us to accurately predict the behavioral 
experiences by just sliding through the recorded neural population activity and calculating the 
hit ratio of matching those binary codes with the occurrences of each startling event. Part C is 
adopted from Lin et al. (2005) 


This type of the universal binary code can provide a potentially unifying framework 
for the study of high cognition. even across animal species. For example, should 
a mouse, dog, and human all experience a sudden free-fall in a plunging elevator, the 
activation patterns of the general startle neural clique, drop-specific clique, air-puff 
clique, and earthquake clique in their brains would produce the identical real-time 
binary activation code (1, 1, 0, 0), according to the above permutation and arrangement 
of the coding unit assembly. Yet, since the mouse, dog, and human may perceive other 
subtle information differently during the incident, the subsequent digits may differ. 
For example, the dog may sense a trace amount of smell of burning wires, whereas the 
human may see erratic flicking of elevator buttons, and the mouse may have a flying 
candy wrapper hit its face. Therefore, the binary activation codes would permit the 
universal measurement and categorization of neural representations between those 
three species, with the initial four digits defining the common experience of free falling 
and the subsequent digits corresponding to different subtle details. 

The proposed binary codes, derived from the activation patterns of the neural clique 
assembly, offer a concise way to universally categorize the neural representations of 
cognition in the various brains (Lin et al. 2005, 2006). In the meantime, it is important 
for us to point out the fundamental differences between the neural clique pattern- 
based brain codes and the nucleotide-based genetic codes. Specifically, the neural 
clique-based brain codes have at least four distinct properties: 1) un-inheritable: genetic 
codes are directly transferred through reproduction, whereas brain codes, by and large, 
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are not inheritable and can only be acquired through experiences (perhaps with the 
exceptions of those neural codes controlling the primitive functions, such as breathing, 
heartbeat, and knee-jerk reflex, etc., that may have been genetically programmed); 2) 
self-organizing: genetic codes act like pre-determined scaffolds, providing blueprints 
for the development and basic functionality of the organism, whereas brain codes are 
dynamic and self-organizing, arising out of internal structures and connectivity of 
neural networks upon behavioral experiences; 3) variable sizes: the numbers of genes 
are exactly fixed for each individual and species, whereas the number of brain codes 
is highly variable in each brain; in theory, it is only limited by the network capacity 
(which is determined by the convergence and divergence in connectivity), as well as the 
number of behavioral experiences that an individual encounters; and 4) modifiable: 
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unless mutated, the genetic code remains static, whereas the membership of individual 
neurons within a given neural clique is modifiable by experience-dependent synaptic 
plasticity or disease states. Thus, the above features of the brain codes are set apart in 
a fundamental way from the genetic codes. 

The identification of neural cliques as memory coding units in the hippocampus 
prompts us to entertain the concept that views neural cliques as basic, self-organizing 
processing units may be applicable to many, if not all, neural networks in the brain. 
Under this neural clique code model (Box 2), the functionality implemented by neural 
cliques in a given network depends on the specializations of the corresponding regions. 
In primary sensory regions, neural cliques in those regions (perhaps, organized in the 
forms of cortical columns) encode piecemeal information by decomposing external 
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Box 2. Neural clique code-based real-time information processes in the brain. 

Through a series of hierarchical-extraction and parallel-binding processes, the brain achieves 
coherent internal encoding and processing of the external events (Lin et al. 2006). For example, 
when a person experiences a sudden earthquake, neural cliques in his primary visual cortex en- 
code the decomposed features about edge orientation, movement, and eventually shapes of visual 
objects, whereas the neural cliques in the vestibular nuclei detect sudden motion disturbances. 
As information is processed along its pathways into deeper cortex, such as the inferior temporal 
cortex (IT), neural cliques begin to exhibit complex encoding features such as houses. By the time 
it reaches high association cortices, such as the hippocampus (HP) and temporal medial cortex 
(TMC), the neural clique assembly encodes earthquake experience and its location, with a selec- 
tive set of “what and where” information. At this level, abstract and generalized information such 
as semantic memories of “the earthquake is dangerous and scary” have emerged. As information 
is further processed into other cortical regions involving decision-making and motor planning, 
a series of phased firings among various neural clique assemblies lead to adaptive behaviors such 
as screaming and running away from the house, or hiding under a dining table 
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events into various basic features (e.g., the primary visual cortex for detecting edge 
orientation, color or size of visual objects; the vestibular nuclei for detecting motion, 
etc. Box 2). As information is further processed along its pathways into deeper regions, 
neural cliques (although no longer organized in their anatomically distinguishable 
maps or columns) start to encode more complex features (e.g., shapes and complex 
objects such as houses and faces in the inferior temporal cortex). By the time they reach 
high association cortices such as the hippocampus, neural cliques have already con- 
tained both specific and generalized mnemonic information about events, places and 
people with a significant amount of abstraction and generalization (Box 2). Eventually, 
the brain areas involved in decision-making, executive function and motor planning 
may start coherent and phased firings among various neural cliques, thereby generating 
behaviors. 

In summary, recent identification of neural cliques as the basic coding units in 
the brain has provided crucial insights into the network-level organizing principles 
underlying real-time memory encoding. Those neural cliques are self-organized in 
a combinatorial fashion to form a memory encoding assembly with an invariant hi- 
erarchical structure. This feature-encoding hierarchical structure of the neural clique 
assembly immediately suggests a network mechanism for the brain to achieve both 
large memory storage capacity and higher cognitive functions, such as abstraction and 
generalization. 
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Making and Retaining New Memories: The Role 
of the Hippocampus in Associative Learning and Memory 


Wendy A. Suzuki 


Introduction 


The groundbreaking description of the amnesic patient H.M. in the 1950s (Scoville 
and Milner 1957) demonstrated for the first time that the structures of the medial 
temporal lobe are critical for our ability to learn and retain new long-term memories 
for facts and events. This critical form of memory is referred to as declarative memory 
in humans (Squire et al. 2004) and relational memory in animals (Eichenbaum et al. 
1999). The development of powerful animal models of human amnesia in monkeys 
(Zola-Morgan and Squire 1990; Mishkin 1978; Zola and Squire 2000; Suzuki et al. 1993) 
and in rodents (Bunsey and Eichenbaum 1993,1995, 1996; Fortin et al. 2002), together 
with detailed neuroanatomical studies (Suzuki and Amaral 1994a,b; Burwell and Ama- 
ral 1998a,b), demonstrated definitively that the key medial temporal lobe structures 
important for declarative/relational memory include the hippocampus together with 
the surrounding entorhinal, perirhinal and parahippocampal cortices. While this con- 
vergence of studies in humans and animals has provided detailed information about 
the pattern of memory impairment following a wide range of lesions to the medial 
temporal lobe, less information is known about how individual cells in the intact me- 
dial temporal lobe participate in the acquisition, consolidation or retrieval of various 
forms of declarative/relational memory. 

To address this question, we have used single unit electrophysiological recording 
techniques to examine the patterns of neural activity in monkeys performing con- 
trolled, memory-demanding tasks. We have focused on associative memory, defined 
as the ability to learn the relationship between unrelated items such as the name of 
someone you have just met or the shelf in the kitchen where the teapot is stored. Im- 
portantly, this form of declarative/relational memory is known to be highly dependent 
on the integrity of the medial temporal lobe (Brasted et al. 2002, 2003; Rupniak and 
Gaffan 1987; Murray and Wise 1996; Murray et al. 2000; Wise and Murray 1999). 

Figure 1 illustrates the three major stages in the “life” of an associative memory. 
The first stage of associative memory is acquisition, when the association is encoun- 
tered for the first time and the relationship between the unrelated elements is first 
learned. During the consolidation stage, the newly formed associative memory can be 
strengthened through repetition. However, lesion studies have demonstrated that the 
memories remain dependent on the medial temporal lobe during this stage (Bayley 
et al. 2005). If the new associative information is successfully consolidated and it enters 
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Fig. 1. Stages of memory. Illustration of the three major stages of an associative memory, starting 
with acquisition, when the association is just being learned, consolidation, during which time 
the memory is susceptible to disruption, and finally the long term memory (LTM) stage, when 
the memory becomes resistant to damage to the medial temporal lobe 


long-term memory, it is no longer susceptible to disruption by damage to the medial 
temporal lobe, although this structure still may participate in aspects of representation 
or retrieval (Schacter and Wagner 1999; Schacter et al. 1995, 1996; Nyberg et al. 1996a,b). 

We have been particularly interested in the dynamic aspects of associative memory 
formation, and our first studies focused on the patterns of activity of hippocampal 
neurons during the early acquisition phase, when new associative memories are first 
being formed. These studies revealed strong associative learning signals in the monkey 
hippocampus. We next asked how these associative learning-related signals might be 
transformed as the initially novel associations became highly familiar with weeks and 
months of practice. To address this question, we examined the patterns of hippocampal 
responses to very well-learned associations and compared them to the responses to 
novel associations. Our data suggest that the hippocampus not only plays a critical 
role in signaling new associations but also that there is a strong representation of 
well-learned information in the hippocampus. 


The Neural Correlates of Associative Memory Formation 
Location-Scene Association Task 


To examine the patterns of neural activity during the formation of new associative 
memories, we recorded the activity of individual hippocampal neurons as monkeys 
performed a location-scene association task (Wirth et al. 2003). In this task, animals 
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Fig.2. Location-scene association task. In this task, following fixation, animals are shown a set of 
four identical visual targets superimposed on a complex visual image (images used in task were 
all in color). Following a delay interval, during which time the targets remained on the screen 
but the scene disappeared, the animal was cued to make an eye movement response (illustrated 
schematically by the white arrow) to one of the targets. Only one of the targets was rewarded 
for each particular scene. Animals learned by trial and error to associate each new scene with 
a particular eye movement response. Animals were also shown highly familiar “reference” scenes 
that they had seen many times before and on which they performed at ceiling levels 


learned which one of four identical targets superimposed on a complex visual scene was 
associated with reward (Fig. 2). The task started with the monkeys fixating on a central 
fixation point. They were then presented with four identical targets superimposed on 
a complex visual scene. Following a delay interval where the visual scene disappeared 
but the targets remained on the screen, the fixation spot was extinguished, which was 
the animal’s cue to make an eye movement to one of the four possible targets. Only one 
of the targets was rewarded for each new scene. Each day, animals were presented with 
a random mix of two to four new scenes (each associated with a unique rewarded target 
location) together with two to four highly familiar “reference” scenes (also associated 
with four different rewarded target locations). The new location-scene associations 
were learned in an average of 12 + 1 trial and animals performed the reference scenes 
at or near ceiling levels. 


Learning-related neural activity in the monkey hippocampus 


We hypothesized that cells signaling learning would change their activity in a manner 
that was closely correlated with the animal’s behavioral learning curve for the new 
scenes, but not for references scenes. Consistent with this hypothesis, we found that 
18% of the total population of sampled hippocampal cells (or 28% of the cells that 
responded selectively during at least one phase of the task) signaled new learning with 
dramatic changes in firing rates (Fig. 3). This changing neural activity was significantly 
correlated with the animal’s behavioral learning curve. We call these cells “changing” 
cells. Approximately half the changing cells increased their activity during either the 
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Fig. 3. Illustration of the trial by trial probability correct performance (dotted line read from the 
left axis) as a function of the trial by trial activity of cells during either the scene or delay period 
of the task (solid line read from right axis) for a sustained (left) and baseline sustained (right) 
cell. Note the strong positive or negative correlation between neural activity and learning 


scene or delay periods of the task correlated with learning, and this change in neural 
activity was sustained for as long as we were able to hold the cell (sustained changing 
cells; Fig. 3, right). Some of the sustained changing cells started out with very weak or 
no response during the early trials of the session and only developed a strong response 
as the animal learned a particular association. 

The remaining half of the changing cells responded robustly and selectivity to 
a particular scene early in the session before any association was learned. These cells 
signaled learning by decreasing their firing rate to baseline levels, and this decreasing 
activity was anti-correlated with learning (baseline sustained changing cells). The 
responses of both sustained and baseline sustained changing cells were highly selective, 
in that the changes in neural activity only occurred for a particular learned scene. To 
determine if these changes in neural activity represent a motor signal, we compared 
the response of the changing cell to the response of the same cell to the reference 
scenes with the same rewarded target location (i.e., the same motor response for both 
new and reference scenes). In no case did the changing cells respond similarly to the 
references scenes, suggesting the changing signal was not simply signaling the direction 
of movement. To determine if the changing cells signal learning specific for particular 
rewarded target locations, we recorded the activity of a changing cell during learning 
of two different new scenes with the same rewarded target location. Typically, the cell 
would change in parallel to learning for the first new scene. However, similar changes 
in activity were never seen for the second new scene with the same rewarded target 
location. Thus, hippocampal changing cells do not appear to signal new learning in 
a motor-based or direction-based frame of reference. Instead, these findings suggest 
that hippocampal cells signal fast associative learning between sensory stimuli and 
motor responses or target locations. This interpretation is consistent with theories 
suggesting that the hippocampus plays a fundamental role in forming the random 
associations or relationships between unrelated items (Eichenbaum et al. 1999). These 
kinds of simple associations may be critical to build up the more complex associations 
between the “what”, “where” and “when” information that forms the basis of episodic 
memories. 
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What does the change in neural activity represent? 


Previous studies have shown that neurons in both the perirhinal cortex and area TE 
signal long-term associations between visual stimuli (visual-visual paired associate 
memory) by responding similarly to the two items that had been paired in memory 
(Naya et al. 2003; Sakai and Miyashita 1991). These findings suggested that the learning 
of the paired associates may have “tuned” or “shaped” the sensory responses of these 
cells towards a similar response to the two stimuli paired in memory. Consistent with 
this idea, several other groups demonstrated that perirhinal neurons show a shift in 
response selectivity during the associative learning process (Erickson and Desimone 
1999; Erickson et al. 2000; Messinger et al. 2001). These findings suggested that the 
striking changes in neural activity observed during the location-scene association task 
may representa change in the cell’s stimulus selective response properties with learning. 
To address this possibility, we examined the average response of a single changing cell 
to all new scenes and reference scenes over the course of learning. The example cell, 
illustrated in Fig. 4A, did not differentiate between any of the new or reference scenes 
during the scene period of the task early in the learning session. However, this cell 
appeared to develop a highly selective response to new scene 2 (black line), which 
occurred in parallel with learning (thick gray line). To quantify this observation across 
the changing cells, we measured selectivity using a selectivity index (Moody et al. 
1998) to the responses to all new and reference scenes before versus after learning. 
We analyzed the sustained and baseline sustained changing cells separately. We found 
that, whereas the sustained cells exhibited a significant increase in selectivity (Fig. 4B) 
with learning, the baseline sustained cells exhibited a significant decrease in selectivity 
(Fig. 4C). These findings suggest that hippocampal cells signal new associations with 
a significant change in their stimulus-selective response properties. 


Timing of the changing cells relative to learning 


We next focused on understanding the timing of these neural changes relative to 
behavioral learning. If these changing cells changed slightly before behavioral learning 
was expressed, this would suggest a role in driving the learning process. Changes 
in neural activity that occurred after behavioral learning was expressed would be 
consistent with a role in strengthening of the newly formed association. To address 
this question, for each cell that changed for a particular condition, we estimated the 
trial number of learning as well as the trial number of neural change. Figure 5 shows 
a scatter plot of this comparison for all changing cells in the hippocampus. This plot 
makes several important points. First, whereas about half of the changing cells changed 
before or at the same time as learning, the remaining half changed after learning. These 
early changing cells suggest that the hippocampus may be among the earliest brain 
structures to signal or drive new associative learning. Second, these data also suggest 
that, while early learned associations (i.e., associations learned in less than about 15 
trials) tended to change after learning, neurons only started to change before behavioral 
learning if the animal took longer to learn the association. This pattern suggests the 
possibility that different behavioral (and neural) strategies may be employed early in 
the session, when the scenes are novel and there is more overall uncertainty associated 
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Fig. 4. (A) Average response to four reference scenes and two new scenes over the course of the 
recording session for a sustained changing cell. The learning curve for New Scene 2 is illustrated by 
the thick gray line. (B) Graph showing the significant increase in selectivity index for sustained 
changing cells after learning compared to before learning. (C) In contrast, baseline sustained 
changing cells decreased their selectivity after learning compared to before learning 


with them, and later in the session, when the scenes are more familiar and at least one 
or two have been learned. 

To summarize thus far, we have shown that many hippocampal neurons signal 
learning by changing their stimulus-selective response properties. While some neurons 
increase their activity with learning, others decrease their activity (and stimulus- 
selective responses), suggesting that these findings may represent an overall tuning 
of the network of hippocampal responses with learning. Because these changes occur 
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Fig. 5. Scatter plot illustrating the temporal relationship between trial number of behavioral 
change (i.e., learning) and trial number of neuronal change. Note that about half the cells change 
before or at the same time as learning whereas the remaining half of the cells change before 
learning. Moreover, there is a tendency for cells to change after learning if the scene was learned 
quickly (i.e., faster than about 15 trials) and to change after learning if the scene was learned 
more slowly 


before, at the same time as, and after learning, there may be a gradual recruitment of 
a network of hippocampal neurons throughout the early formation of new associative 
memories. 


The Neural Correlates of Well-Learned Associations 


While the previous section illustrated the dramatic changes that occur in hippocampal 
activity early in learning, we next asked if this signal might persist in some form as 
the memory is consolidated to represent very well-learned information. Previous find- 
ings have shown that cortical areas exhibited two general categories of long-lasting, 
experience-dependent plasticity. First, some cells exhibit a gradual decrease in neu- 
ral activity as an initially novel stimulus becomes familiar (Fahy et al. 1993; Riches 
et al. 1991; Li et al. 1993). Other studies have reported cortical neurons that increase 
their stimulus-selective responses to particular, well-learned stimuli (Baker et al. 2002; 
Kobatake et al. 1998; Logothetis and Pauls 1995; Sigala et al. 2002). While the hip- 
pocampus has most commonly been associated with new learning and consolidation 
of newly learned information, other studies have suggested a role in retrieval of well- 
learned information (Schacter and Wagner 1999; Schacter et al. 1995, 1996; Nyberg 
et al., 1996a,b). Indeed the role of the hippocampus in signaling information about 
well-learned information has not been examined in great detail. To address this ques- 
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tion, we examined the responses of hippocampal neurons to the well-learned reference 
scenes of our location-scene association task (Yanike et al. 2004; see also Fig. 2). In 
most cases, the animals had between 6 and 22 months of previous experience with the 
reference scenes before the cells were recorded. Thus, these reference scenes were both 
well-learned and extensively practiced. 


Selectivity for reference vs. novel scenes 


Similar to reports for other cortical areas, we found that the selectivity of the hip- 
pocampus was modulated by the extensive training with the reference scene compared 
to the new scenes. Using a selectivity index (SI Moody et al. 1998), we analyzed the 
responses of hippocampal cells with selective responses during the scene and delay 
periods of the task, comparing the selectivity to new and reference scenes. We found 
that the average SI for the well-learned references scenes was significantly higher than 
the average SI for the new scenes during both the scene period of the task and the delay 
period of the task. Thus, hippocampal cells respond significantly more selectively to 
the well-learned references scenes compared to the novel scenes. An example of an 
individual hippocampal cell with a higher SI to reference scenes compared to new 
scenes is shown in Fig. 6. This cell responded maximally to one reference scene but did 
not discriminate between the other new or reference scenes in that day’s set. 


Time course of the selective response 


To further characterize the differences in selective responses to new and reference 
scenes, we analyzed the temporal dynamics of when the differences in selectivity first 
appeared. For each cell, we computed the average activity over successive 50-ms bins 
aligned at the scene onset, and we calculated a selectivity index (depth of tuning index, 
or DTI; Moody et al. 1998) for each of the successive 50-ms bins throughout the scene 
and delay periods of the task separately for reference and new scenes. We found that the 
enhanced selectivity was present as soon as the cells responded to the visual stimuli. 
Moreover, the enhanced selectivity was maintained throughout the scene and delay 
periods of the task. 

While many previous studies have shown that neocortical cells in sensory areas can 
shift their sensory tuning with practice, our findings show that hippocampal neurons 
exhibit this same pattern of increasing selectivity to highly familiar stimuli. These 
results are particularly surprising given the previous reports that damage to the hip- 
pocampus does not impair performance on well-learned associations (Murray and 
Wise 1996; Wise and Murray 1999). These physiology studies suggest that, despite 
the fact that other areas may be able to sustain memory for well-learned informa- 
tion, the hippocampus nonetheless participates in the representation of well-learned 
information. These findings are consistent with a role in retrieval of well-learned in- 
formation suggested by functional imaging studies in humans (Schacter and Wagner 
1999; Schacter et al. 1995, 1996; Nyberg et al. 1996a,b). 
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Fig. 6. (A) Distribution 
of selectivity index for 
all selectively responding 
hippocampal cells shown 
separately for new and 
reference scenes. The 
average SIs for the ref- 
erence scene and new 
scenes were 0.55 and 
0.47, respectively. These 
values are significantly 
different from each other. 
(B) Response of a single 
selective hippocampal cell 
to four reference scenes 
and three new scenes. 
This cell responded highly 
selectively to a particular 
reference scene. The SI 
to the references scenes 
was 0.7 whereas the cor- 
responding value for the 
new scenes was 0.3 


The findings summarized above demonstrate that hippocampal activity is involved in 
two key stages in the “life” of an associative memory. First, the hippocampus provides 
a prominent and highly selective signal for the encoding of new associations. A detailed 
analysis of the temporal relationship of these changing neural signals relative to behav- 
ior suggests that hippocampal neurons can signal learning before, at the same time as, 
and after learning. This finding suggests a role both in driving learning behavior and 
in a strengthening process. We also showed that these plastic processes are reflected in 
a shift in the stimulus-selective response properties of the hippocampal cells. The same 
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Fig. 7. Average selectivity values using a depth of tuning index (DTI; Moody et al., 1998) plotted 
for successive 50-ms bin intervals through the scene and delay periods of the task. Asterisks 
indicate significant differences. Dark line on the X-axis shows the time during which the stimulus 
was shown 


population of sampled hippocampal cells also signals very well-learned and extensively 
practiced associations with a significantly more selective response compared to novel 
scenes. These results suggest the possibility that the plasticity underlying the changing 
activity during learning may also underlie the increased selectivity seen in response 
to well-learned reference scenes. Thus, with extensive daily experience, the striking 
learning-related plasticity observed during initial learning may eventually evolve into 
the enhanced selectivity to highly familiar reference scenes. This latter form of selective 
activity may play a role in the representation of well-learned information as well as in 
the retrieval process. Future studies will be needed to better define the time course of 
these effects and the role of the hippocampus in the strengthening or consolidation 
process that occurs between new learning and long-term memory representation. 
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Entorhinal Grid Cells and the Representation of Space 


Francesca Sargolini! and Edvard I. Moser” 


Summary 


The ability to find one’s way depends on the brain’s ability to integrate information 
about location, direction and distance. The algorithms responsible for this integration 
are implemented in a large brain network that includes both hippocampal and parahip- 
pocampal cortices, as indicated by the existence of place cells in the hippocampus and 
head-direction cells in the dorsal presubiculum and a number of other regions. Recent 
results have pointed to the medial entorhinal cortex (MEC) as a possible site for the 
dynamic representation of position in animals that move through a two-dimensional 
environment. Layer II of the MEC contains position-sensitive neurons - grid cells - 
whose firing fields form a periodic triangular pattern that tiles the entire environment 
covered by the animal during exploration of an open surface. Grid cells are observed 
in all principal layers of MEC, but intermingle with head direction cells in layers III, 
V and VI. The two cell types form a continuous population in which a subset of the 
neurons, predominantly in layers III and V, have conjunctive grid and head-direction 
properties. The majority of the cells are modulated by velocity. These observations sug- 
gest that, despite the differential hippocampal and neocortical connections of different 
layers of the MEC, the area operates as an integrated unit, with significant interlaminar 
interaction between cells with different functional properties. As the animal moves 
across a surface, activity may be translated across the superficial sheet of grid cells by 
convergence of position, direction and velocity information in a neural subpopulation 
with conjunctive firing properties. 


Introduction 


More than three decades of research have pointed to the hippocampus as a key region 
of the brain’s neural map of the spatial environment (O’Keefe and Nadel 1978). An 
important milestone was reached when O’Keefe and Dostrovsky (1971) discovered 
that cells in the hippocampus have spatial receptive fields (place fields), discharging 
only when the animal was at particular locations in the environment. A number of 
studies showed that spatial memory was impaired after damage to the hippocampus 
(Olton et al. 1978; Morris et al. 1982; Nadel 1991), and the hippocampus was found to 
be activated in normal human subjects as they searched for hidden goals in a spatial 
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environment (Maguire et al. 1997, 1998). The animal’s position in the external world 
was shown to be represented in the collective activity of hippocampal neurons; from the 
simultaneous firing of a cluster of co-localized neurons, it became possible to predict 
with considerable accuracy the animal’s location as it moved through an open field 
(Wilson and McNaughton 1993). As the years passed, however, accumulating evidence 
suggested that the functions of place cells extended beyond a specific function in 
mapping of the physical space, confirming the impression of clinical neuroscientists 
that hippocampal lesions led to impairment in a much wider spectrum of associative 
memory functions (Scoville and Milner 1957; Squire et al. 2004). Place cells were shown 
to respond to a variety of non-spatial sensory inputs (Wood et al. 1999, 2000; Frank et al. 
2000; Hollup et al. 2001) and to alternate between multiple representations in the same 
location, reflecting both salient physical properties of the place and events associated 
with the place (Bostock et al. 1991; Markus et al. 1995; Kentros et al. 1998; Lever et al. 
2002; Leutgeb et al. 2005a,b). Based on these and a number of other observations, most 
researchers have converged towards the view that the hippocampus has a broad role 
in the encoding of context-specific or episodic memories, in which spatial location is 
a critical but non-exclusive part of what is stored in this region (Leutgeb et al. 2005c). 

One of the strongest pieces of evidence for a spatial map in the hippocampus was the 
apparent contrast between the low spatial information of entorhinal neurons and the 
high spatial information of place cells downstream in the hippocampus (Barnes et al. 
1990; Quirk et al. 1992; Frank et al. 2000), which suggested that the location-specific 
signal was computed internally in the hippocampal circuitry, between the input neurons 
of the superficial layers of the entorhinal cortex and the place cells of the hippocampus 
proper. Several studies during the last few years have refuted this view and instead 
suggested that the animal’s current position is computed upstream of the hippocampus 
in the MEC (Fig. 1), at the intersection between directional input from the presubiculum 
and visual and movement-related inputs from the postrhinal and retrosplenial cortices 
(Witter and Amaral 2004). With the aid of small microelectrodes, we have recorded 
neural activity simultaneously from a number of individually separable neurons in MEC 
in rats that explore spatial environments of different sizes and shapes. Many principal 
neurons in layers II and III of MEC were found to have multiple, sharply delineated 
firing fields that collectively signalled the rat’s current position as accurately as place 
cells in the hippocampus (Fyhn et al. 2004). The sharpest firing fields were observed 
dorsocaudally in the MEC, in the dorsolateral band, which contains the majority 
of the projection neurons to the well-defined place cells of the dorsal hippocampus 
(Witter et al. 1989; Dolorfo and Amaral 1998). Firing fields became gradually wider 
and more dispersed along a dorsolateral-to-ventromedial axis through the MEC. In the 
intermediate-to-ventromedial bands, where all of the previous recordings had been 
made (Quirk et al. 1992; Frank et al. 2000), firing fields were diffuse and covered almost 
the entire recording environment (Fyhn et al. 2004). No spatial structure could be 
observed at the ventral end of MEC. Lesion studies confirmed the importance of the 
dorsolateral band for navigation in spatial laboratory tasks (Steffenach et al. 2005). 
Localized firing in MEC neurons was maintained without a functional hippocampus, 
and similar spatially modulated signals were not observed in the postrhinal cortex, from 
which most of the visuospatial afferent input originates (Witter et al. 1989; Burwell 2000; 
Witter and Amaral 2004). These findings all suggest that information about current 
position is, to a large extent, computed intrinsically in the entorhinal cortex. 
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Fig. 1. Entorhinal cortex of the rat. (A) Ventral posterior view of the whole rat brain. POR, 
postrhinal cortex; PER, perirhinal cortex; MEC, medial entorhinal cortex; LEC, lateral entorhinal 
cortex. (B) Sagittal Nissl-stained section of the rat brain. Red dot indicates a recording position 
at the dorsocaudal end of MEC. Red lines indicate the dorsal and ventral borders of the MEC. (A) 
is reproduced from Sargolini et al. (2006) 


The observation that entorhinal neurons have spatial receptive fields pointed to 
an entirely new function for the entorhinal cortex and suggested that spatial signals 
in the hippocampus are derived from afferent areas rather than being computed in 
the structure itself. The hippocampus was thereby “released” from an exclusive spatial 
domain, and a primary role in the encoding and storage of associative memory, both 
spatial and non-spatial, seemed more likely. 


Grid cells 


The key element of the neural map in MEC is the “grid cell”, a cell type that is 
different from any other functionally defined cell category in the nervous system. 
In our early recordings, we noticed that the majority of the cells in this brain area 
were active exclusively when the rat was at certain locations in the environment, and 
that these locations formed a regular pattern with a non-random spacing (Fyhn et al. 
2004). In subsequent work, using recording enclosures that were large enough to 
capture the spatial structure of the firing fields of the entorhinal neurons, we observed 
that the activity peaks formed a highly periodic pattern (Hafting et al. 2005). For 
each cell, the set of active locations defined a regular array, or a grid, covering the 
entirety of the animal’s environment, like the cross-points of graph paper rolled out 
over the surface of the test arena (Fig. 2A). The repeating unit, however, was not 
a square, like on the graph paper, but an equilateral triangle or, more precisely, a pair 
of oppositely oriented equilateral triangles. Grids of different cells were offset relative 
to each other (Fig. 2B), such that each position in the environment could be identified 
from the activity of a fairly limited number of adjacent cells. Using the collective 
activity of a small set of grid cells, we were able to show that it was possible to predict 
the animal’s current location as it moved through space with a precision of a few 
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Fig. 2. Firing fields of grid cells in MEC. (A) A representative grid cell in layer II. From left to 
right: trajectory of the rat (black) with superimposed spike locations (red); color-coded rate map 
with peak rate indicated; and spatial autocorrelation of the rate map. The color scales are from 
blue to red (minimum to maximum rate, or autocorrelation from r = —1 to r = 1). (B) Grid 
fields of neighboring cells have different spatial phases, i.e., their grids are offset relative to each 
other. Top row: Combined maps of three simultaneously recorded grid cells (t1c1, t2c1, and t2c2; 
each shown in a different color). The left map shows the locations of all spikes on the animal’s 
trajectory; the middle map shows the peak firing locations of each subfield; in the right map, the 
peaks of the three maps are offset to show the similarity in spacing and orientation. Bottom row: 
Spatial cross-correlation of the same three cells. Note that the peaks of the cross-correlograms 
are offset from the origin, indicating a phase shift in the spatial maps. Reproduced from Hafting 
et al. (2005) 


centimetres (Fyhn et al. 2004). These observations point to the network of entorhinal 
grid cells as a possible neuronal coordinate system used by mammals during spatial 
navigation. 
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A key property of the newly discovered entorhinal spatial map is its apparently uni- 
versal activation (Hafting et al. 2005). Unlike place cells in the hippocampus, the grid 
fields of entorhinal neurons are activated in a stereotypic manner across environments, 
regardless of the environment’s particular landmarks, suggesting that the same neural 
map is applied wherever the animal is walking. Grids persist in darkness and appear 
almost instantaneously as an animal is introduced into a novel environment. Yet they 
are also controlled by extrinsic cues, as grid fields follow the landmarks when the walls 
of an environment are rotated. These observations suggest that, while landmarks are 
important for aligning grid representations with the particular landmarks of an envi- 
ronment, keeping representations stable from one occasion to the next, the dynamic 
computation of position is apparently based on the animal’s own movement, using 
distance and direction of movement as the primary inputs, without reference to the ex- 
ternal environment. The grid-cell map is thus likely to be part of the brain’s mechanism 
for path integration (Mittelstaedt and Mittelstaedt 1980). Because of its grid-like na- 
ture, the map can potentially represent places not visited just as well as places that have 
been visited. The grid map is potentially infinite, allowing the inclusion of all places 
and potential places. The existence of a single neural map of the environment that can 
be applied anywhere is economical and avoids the enormous capacity problems that 
might arise if the brain were to store rules about spatial interrelations for every single 
spatial context that the subject encountered. 

A striking feature of the entorhinal map is its regular organization, with grids 
of neighboring cells having a common spacing (distance between activity spots) and 
a common orientation (direction of the three axes). The spacing of the grid increases 
progressively as one gets deeper into the brain. The combination of grids at vari- 
able scales, within the entorhinal cortex or downstream in the hippocampus, provides 
an economical, high-resolution spatial coordinate system for navigation over a large 
space. If grid cells had a single, common scale, the population code would repeat itself 
at intervals corresponding to a single period of the grid, and the place code would 
be ambiguous. If activity is integrated across grids with different spacings and orien- 
tations, however, the cycle for repetition might be very large, enabling each position 
within this radius to be expressed by a unique pattern of collective activity in the grid 
network. In contrast to the topographic organization of spacing and orientation, the 
phase of the grid - the relative displacement of grid vertices in different cells - is ran- 
domly distributed. These organizational features are all mirrored downstream in the 
hippocampus, in which neighboring cells have different firing locations and properties 
(O’Keefe and Nadel 1978; Hirase et al. 2001; Redish et al. 2001), and the scale of the 
place fields increases along the septal-to-temporal axis of the brain area (Jung et al. 
1994; Maurer et al. 2005; Kjelstrup et al. 2006). 


Head-direction cells and conjunctive cells 


The regular structure of the grid field and its relative independence of landmarks in 
the environment implicate the grid cell as part of a universal, path-integration-based 
spatial metric (Hafting et al. 2005; Fuhs and Touretzky 2006; McNaughton et al. 2006). 
A role for grid cells in path integration is also consistent with the inability of animals 
with entorhinal cortex lesions to calculate a return path to their home cage on the basis 
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of self-motion cues (Parron and Save 2004). How the position vector is computed is 
not understood, however. To explore how grid cells interact with cells in other layers 
of the MEC, we recorded from each principal cell layer of MEC in rats that explored 
two-dimensional environments (Sargolini et al. 2006). Grid cells were found in all 
principal cell layers (Fig. 3A), but the proportion of cells with grid properties was 
layer-dependent. In layer II, most well-separated cells exhibited periodic firing, as in 
our earlier observations (Hafting et al. 2005). In the deeper layers, a subset of the 
neurons had clear grid-like firing fields, but the proportion of grid cells was smaller. 


pos 3Hz 
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Fig. 3. Representation of position and direction in MEC neurons. From left to right: trajectory 
map with individual spike locations, color-coded rate map, autocorrelation matrix, and polar 
plot indicating firing rate as a function of head direction. Peak rate is indicated. Symbols for 
trajectory maps, rate maps and autocorrelograms are as in Fig. 2. (A) Representative grid cell 
from layer II. (B) Representative head-direction cell, recorded from layer V. (C) Representative 
grid x head-direction cell, recorded from layer III. Reproduced from Sargolini et al. (2006) 
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The scale of the grid increased from the dorsocaudal to the ventromedial end of the 
MEC, like in layer II, such that cells near the postrhinal border had the densest spacing 
and the smallest firing fields. 

In all layers except layer II, grid cells intermingled with head-direction cells whose 
properties were similar to those of the head-direction cells recorded in dorsal pre- 
subiculum and anterior thalamus in previous studies (Taube et al. 1990a,b; Knierim et 
al. 1995). The firing rate of entorhinal head-direction cells increased from a very low 
background rate to peak rate of up to 40 Hz or more whenever the rat’s head turned to 
a certain range of directions. Most head-direction cells had a sharp directional tuning 
that was similar over the entire apparatus and stable across trials (Fig. 3B). Their peak 
firing directions were widely distributed, and when several head-direction cells were 
recorded simultaneously, their preferred directions were usually uncorrelated (Fig. 4). 
Together with the distributed spatial phase of co-localized grid cells (Fig. 2B), and the 
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co-localization of grid cells and head-direction cells (Fig. 5), the distributed organiza- 
tion of directional preferences may enable the representation of all possible positions, 
distances and directions within small areas of the MEC. 

Grid cells and head-direction cells were found to form overlapping populations, 
where grid cells express variable degrees of directional modulation and head-direction 
cells express variable degrees of grid-like firing structure. Cells with dual firing prop- 
erties were referred to as conjunctive cells (Fig. 3C). All conjunctive cells were also 
modulated by the animal’s instantaneous speed of movement, enabling them to com- 
bine information about the origin of a trajectory with information about how far and 
in what direction the animal has moved at any given time. The presence of cells with 
conjunctive properties points to a possible mechanism for translation of the position 
representation in the grid-cell layer (Samsonovich and McNaughton 1997; Sargolini 
et al. 2006; McNaughton et al. 2006). One important clue is that conjunctive cells are 
located predominantly in layers III and V, where principal cells have extensive den- 
dritic trees extending up to the pial surface (Lingenhohl and Finch 1991; Hamam et al. 
2000; van Haeften et al. 2003; Kloosterman et al. 2003), which might enable them to 
be activated in a location-specific manner by grid cells in layer II. In addition, these 
principal neurons have extensive axonal projections back to the superficial layers, in- 
cluding, probably, the grid cells (van Haeften et al. 2003; Kloosterman et al. 2003). If 
these ascending projections are asymmetric, contacting cells with a slightly different 
spatial phase, the activation of conjunctive neurons by a specific combination of grid 
cells and directional input cells may result in translation of the position representation 
in the population of grid cells, i.e., a shift of activity to cells with a different spatial 
phase, in a manner consistent with the rat’s motion. The direction of the translation 
may be determined by the head-direction input; the distance may be controlled by the 
speed modulation. 

Cells with conjunctive grid x head-direction properties were encountered in all 
principal layers except for layer II. Together with the strong interlaminar connec- 
tions of the entorhinal microcircuit, this finding implies that, despite the differential 
hippocampal and neocortical connections of superficial and deep layers of the MEC 
(Witter and Amaral 2004), the layers operate as an integrated unit, with significant 
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Fig. 5. Co-localization of grid cells, head-direction cells, and grid x head-direction cells in 
the MEC. Distribution of the number of cells in each category as a function of the recording 
depth (distance from the postrhinal border) in three representative rats (one rat for each layer). 
Reproduced from Sargolini et al. (2006) 
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interaction between grid cells, present in all principal cell layers, and head-direction 
cells and conjunctive cells, present in layers III to VI. 


Conclusions 


The discovery of grid cells has pointed to a possible neural basis for spatial representa- 
tion and spatial memory in the brain. Thirty years of almost exclusive focus on spatial 
representation in the hippocampus has been broadened to include a wider cortical 
network, with the entorhinal cortex and associated parahippocampal areas as major 
contributors. As a prominent hub of this network, the medial entorhinal cortex is well 
suited to integrate information about distance, direction and position into an envi- 
ronmentally invariant, self-motion-based dynamic representation of current position 
during spatial navigation in mammals. 
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The Prefrontal Cortex: Categories, Concepts, 
and Cognitive Control 


Earl K. Miller 


Summary 


What controls your thoughts? How do you know how to act while dining in a restaurant? 
This is cognitive control, the ability to organize thought and action around goals. Results 
from our laboratory have shown that neurons in the prefrontal cortex and related brain 
areas have properties commensurate with a role in “executive” brain function. Perhaps 
most importantly, they transmit acquired knowledge. Here, I discuss how the prefrontal 
cortex and basal ganglia may help obtain our internal representations of rules and 
principles needed for goal-directed behavior, thereby providing the foundation for the 
complex behavior of primates, in whom the prefrontal cortex is most elaborate. 


Introduction 


Virtually all animals are capable of reacting to their immediate environment. Food is 
approached, predators avoided, etc. Many of these reactions are automatic processes 
and reflexive. If something is bearing down on us, we leap out of the way before we have 
even had to consciously form the thought. Automatic processes thus seem to depend 
on relatively straightforward, hardwired, relationships between the brain’s input and 
output systems. In neural terms, it seems that they depend on strong, well-established, 
neural pathways just waiting to be triggered by the right sensory cue. That is, automatic 
processes are driven in a “bottom-up” fashion: they are determined largely by the nature 
of the sensory stimuli and whatever reaction they are most strongly wired to. But more 
advanced animals, such as primates, can also be proactive and goal-directed. We have 
the capacity to predict unseen goals and put together plans to achieve those that are 
within our reach. We can suppress reflexive reactions to the environment and urges in 
order to willfully direct behavior toward anticipated and desired future states. This is 
called cognitive control. 

Learning and memory are at the essence of both classes of behavior. Automatic 
habits seem to be established by repeated co-activation of neural pathways, as if “ruts” 
were being carved into the brain. These habits can be simply triggered and fired off in 
a “ballistic” fashion with little variation; hence they require little internal oversight. In 
contrast, for truly sophisticated goal-directed behavior, simply recording and replaying 
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experiences is not sufficient. Goal-relevant relationships need to be sorted out from 
spurious coincidences and, importantly, long-term goals require more than figuring 
out the world piecemeal, one situation at a time. Smart animals get the “big picture” 
of the jigsaw puzzle of their experiences, the common structure across a wide range 
of experiences. These generalized, abstracted representations offer an efficient way to 
deal with a complex world. They allow the navigation of many different situations with 
a minimal amount of storage and also allow us to deal with novelty. By extracting 
the essential elements from our experiences, we can generalize to future situations 
that share some elements but may, on the surface, appear very different. For example, 
consider the concept “camera”. We do not have to learn anew about every camera we 
may encounter. Just knowing that the item is a camera communicates a great deal of 
knowledge about its parts, functions, and operations. 

How and where such knowledge is acquired by the brain are key questions for 
understanding cognitive control. One cortical region, the prefrontal cortex (PFC), 
seems to play a central role. It is the brain area that reaches its greatest elaboration 
in the human brain and is thus thought to be the neural instantiation of the mental 
qualities that we think of as “intelligent”. Here, I will discuss evidence for the role 
of the PFC in, and theories of, cognitive control, with an emphasis on how the PFC 
and related brain areas might acquire and represent the higher-order knowledge and 
abstract principles needed to orchestrate sophisticated goal-directed behavior. 


The Prefrontal Cortex 


The prefrontal cortex (PFC), the cortex at the very front of the brain, is an ideal place to 
look for neural correlates of abstract information. It occupies a far greater proportion 
of the human cerebral cortex than in other animals, suggesting that it might contribute 
to those cognitive capacities that separate humans from animals (Fuster 1995). At 
first glance, PFC damage has remarkably little overt effect; patients can perceive and 
move, there is little impairment in their memory, and they can appear remarkably 
normal in casual conversation. Despite the superficial appearance of normality, PFC 
damage seems to devastate a person’s life. They have difficulty in sustaining attention 
and keeping “on task”, and they seem to act on whims and impulses without regard 
to future consequences. This pattern of high-level deficits coupled with a sparing of 
lower-level, basic, functions has been called a “dysexecutive syndrome” (Baddeley and 
Della Sala 1996) and “goal neglect” (Duncan et al. 1996). 

Indeed, the anatomy of the PFC suggests that it is well suited for a role as the brain’s 
“executive”. It can synthesize information from a wide range of brain systems and exert 
control over behavior (Nauta 1971; Barbas and Pandya 1991). The collection of cortical 
areas that comprise the PFC has interconnections with brain areas processing external 
information (with all sensory systems and with cortical and subcortical motor system 
structures) as well as internal information (limbic and midbrain structures involved in 
affect, memory, and reward). Correspondingly, its neurons are highly multimodal and 
encode many different types of information from all stages of the perception-action 
cycle (Fuster 1995). They are activated by stimuli from all sensory modalities, before 
and during a variety of actions, during memory for past events, and in anticipation 
of expected events and behavioral consequences, and they are modulated by internal 
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factors such as motivational and attentional state (for review, see Miller and Cohen 
2001). Because of its highly multimodal nature and its apparent role in higher mental 
life, the PFC seemed like an ideal place to begin our search for neural correlates of the 
abstract information needed for intelligent behavior. 


Neural Correlates of Categories and Concepts 


Categorical representations do not faithfully track exact sensory input. They provide 
useful groupings and divisions not present in the external world and thus are the 
building blocks of the high-level knowledge needed for sophisticated, goal-directed 
behavior. Consider a simple example. Crickets sharply divide a range of pure tones into 
“mate” versus “bat” (a predator; Wyttenbach et al. 1996). Even though the input varies 
along a continuum, behavior does not. At the low end of the range, crickets approach 
the sound equally, but then at 16 kHz, their behavior suddenly flips to avoidance and 
remains equivalent across another wide range. This behavior allows the crickets to 
maximize reproduction while minimizing disaster. Another example is humans’ per- 
ception of the facial expressions of emotion (Beale and Keil 1995), which also flips at 
a discrete point. Thus, the representation of perceptual categories must involve some- 
thing beyond the sort of neural tuning that encodes physical attributes, i.e., gradual 
changes in neural activity as attributes gradually change (e.g., shape, orientation, direc- 
tion). Perceptual categories instead have sharp boundaries (not gradual transitions), 
and members of the same category are treated as similar even though their physical ap- 
pearance may vary widely. This is the most fundamental form of categorization. More 
conceptual categories can have “fuzzy” boundaries and are likely to rely on different 
types of representation. 


Shape-based categories 


By learning to take multiple dimensions into account to make sharp distinctions and 
groupings, advanced animals such as primates can acquire relatively complex percep- 
tual categories, such as “animal” (Roberts and Mazmanian 1988), “food” (Fabre-Thorpe 
et al. 1998), and “fish” (Vogels 1999a). The search for neural correlates of such high- 
level, shape-based categories has naturally focused on brain regions at the final stages 
of visual form processing, the inferior temporal cortex (ITC: Desimone et al. 1984; 
Logothetis and Sheinberg 1996; Tanaka 1996) and the PFC, which receives highly pro- 
cessed visual information and orchestrates voluntary, goal-directed behaviors (Fuster 
1995; Miller and Cohen 2001). 

The ITC has long been known to contain neurons with a high degree of selectivity 
for complex objects. Neurons with tuning properties suggestive of categories have been 
identified since the seminal work of Gross and colleagues (Desimone et al. 1984). They 
described a small population of “face cells” that were strongly activated by the gestalt 
of a face but not by individual features or other stimuli. These neurons tend to be 
highly localized and clustered in both the monkey and human temporal cortex, as 
are neurons selectively activated by pictures of familiar places (Kanwisher et al. 1997; 
Epstein and Kanwisher 1998; Tsao et al. 2006). Vogels (1999b) reported a small number 
of inferior temporal neurons in trained monkeys that were specifically activated by 
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“trees” or “fish” and showed relatively little differentiation between diverse examples 
of those categories. Kreiman et al. (2000) recorded from the hippocampus, amygdala, 
and adjacent cortex of human epileptic patients and found a few neurons that were 
highly selective for diverse pictures of familiar concepts like “Bill Clinton”. 

This use of natural images has identified neurons that ultimately contribute to cate- 
gorical representations, but unless neurons are tested for the identifying characteristics 
of perceptual categories (sharp boundaries, relative equivalence within a category), it 
is possible that their activity reflects physical similarity rather than category member- 
ship per se; “trees”, after all, look more like one another than like other stimuli. This 
is not to say that neurons with such complex properties are not critical contributors 
to categories. The question is whether they are the only contributors. Are category 
representations the sum of neurons sensitive to the physical appearance of its members 
or are there neurons that favor information about category groupings over informa- 
tion about the individuals and explicitly represent category per se? That is, are there 
neurons whose activity mirrors the across-category distinctions and within-category 
generalizations seen on the behavioral level? 

To test for neural correlates of perceptual categories, we trained monkeys to cate- 
gorize computer-generated stimuli into two categories, “cats” and “dogs” (Freedman 
et al. 2001; Fig. 1). A novel 3D morphing system was used to create a large set of 
parametric blends of six prototype images (three species of cats and three breeds of 
dogs; Beymer and Poggio 1996; Shelton 2000). By blending different amounts of cat 
and dog, we could smoothly vary shape and precisely define the boundary between 
the categories (> 50% of a given type). As a result, stimuli that were close to but on 
opposite sides of the boundary were physically similar, whereas stimuli that belonged 
to the same category could be physically dissimilar (e.g. the “cheetah” and “housecat”). 

Nearly one third of responsive neurons were category-selective in that they exhib- 
ited an overall difference in activity to cats versus dogs. Similar numbers gave stronger 
activity to cats as dogs. Figure 2 shows a single neuron that exhibited the hallmarks of 
perceptual categorization. It exhibited greater activity to dogs than cats and responded 
similarly to samples from the same category regardless of their degree of dogness or 
catness. This explicit reflection of the cat and dog categories in single neuron activity, 
just like lower-level attributes such as shape or color, may account for the ability of 
primates to rapidly and effortlessly categorize the world around us. 

As our monkeys had no experience with cats or dogs prior to training, it seemed 
likely that these effects resulted from training. To test this, we defined two new category 
boundaries that were orthogonal to the original boundary (see Fig. 1; Freedman et al. 
2001, 2002). Following training, we found that the original categories were no longer 
reflected in activity; instead, the three new categories were. A category index revealed 
no significant effect of the two old, now-irrelevant categories, either across the pop- 
ulation of stimulus-selective neurons or across all recorded neurons. However, when 
the category index was computed using the three new (relevant) category boundaries, 
a significant category effect was observed across these populations, indicating that 
neural activity shifted to reflect the newly learned categories. 

The next question concerned the contribution of visual cortex, namely the ITC. The 
ITC has long been thought to be the final stage in a visual cortex pathway needed for 
object recognition. Damage to it causes deficits in visual discrimination and learning 
(Gross 1973; Mishkin 1982) and category-specific agnosias (e.g., for faces) in humans 
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Fig. 1. “Cat” and “dog” prototypes and morph lines. Stimulus space was divided by one cate- 
gory boundary for cats vs dogs and by two boundaries that created three arbitrary categories. 
Thousands of unique morphs were created from the prototypes. Red lines are between-category 
and blue lines within-category morph lines for the cats and dogs. Examples of stimuli along the 
C1-D1 morph line are shown above 


(Gainotti 2000). It might be that the category information in the PFC was retrieved from 
long-term storage in the ITC for its immediate use on the task. Interactions between 
the PFC and ITC underlie the storage and/or recall of visual memories and associations 
(Rainer et al. 1999; Tomita et al. 1999). 

To determine whether PFC category effects were simply “inherited” from the ITC, 
we recorded from the ITC during performance of our category-matching task. This 
experiment yielded a striking difference. There was virtually no category effect across 
the ITC population and no examples of neurons whose activity showed the sharp 
across-distinction/within-category generalization that is the behavioral signature of 
categorization. Rather, ITC neurons were sensitive to the physical appearance of the 
individual stimuli; its neurons did not “throw away” information about individuals 
in favor of the category groupings, like PFC neurons did. This finding suggests that 
learned visual categories are abstracted at the level of the PFC, not in visual cortex. 
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Fig. 2. The shape category effect on the behavioral level (top) and in a single neuron. Note 
that both show a sharp transition at the category boundary. The six colored lines show the 
neuron’s average activity to stimuli at different morph levels. The colored tiles show activity to 
individual stimuli along the nine morph lines that crossed the category boundary. The bottom 
shows recording sites where shape category neurons were found 


But how are these categories abstracted? Some theories posit that we build up some 
form of average prototypes of each category, others a “list” of defining features. Indeed, 
we found in separate psychophysical experiments that each monkey learned a different 
combination of features to categorize the stimuli (Freedman et al. 2002). Insight into this 
process comes from a study by Sigala and Logothetis (2002). They found that stimulus 
features relevant for category judgments were enhanced in inferior temporal activity 
relative to irrelevant features. This finding suggests a process that weights features 
according to their relevance for category membership, supporting more feature-based 
models of categorization. Further evidence comes from Baker et al. (2002), who recently 
showed that training monkeys that certain features of complex objects “go together” 
results in ITC neural selectivity for those feature combinations. These studies all suggest 
that, while the ITC may emphasize and knit together behaviorally relevant features, the 
PFC is where this information converges to produce explicit representation of learned 
categories that largely discards information about individual category members. 


Numerosity 


The shape category effects in the lateral PFC naturally raised the question of whether 
it is specialized for this type of categorization or whether its role in categorization 
is broader. Small number judgments are markedly different from shape categories. 
Behavioral experiments and modelling work suggest that small number estimations 
do not rely on sharp boundaries like shape categories but overlapping filters tuned to 
different numbers (Dehaene and Changeux 1993; Dehaene et al. 1999). At the same 
time, numerosity is very abstract; “two” can refer to a pair of anything. Finally and 
importantly, unlike the learned cat and dog shape categories, small number estimation 
is innate; many animals can make number judgments without any training (Brannon 
and Terrace 1998; Brannon et al. 2001), as can preverbal human infants (Hauser and 
Hauser 2000; Xu and Spelke 2000; Hauser et al. 2002). Of course, animals cannot derive 
numerical information by verbally and serially “counting” items (like humans); rather 
they encode numerosities in a non-verbal, analog magnitude format (as do infants: 
Xu and Spelke 2000). Such approximate representations of numerical values may be 
regarded as biological precursors of adult humans’ counting abilities. 

Human imaging studies indicate that the frontal cortex and the posterior pari- 
etal cortex (Dehaene et al. 1999) are involved in numerical judgments. So, studies in 
monkeys have focused on those regions. Sawamura et al. (2002) trained monkeys to 
alternate between five arm movements of one type and five of another. They found neu- 
rons in a somatosensory-responsive region of the superior parietal lobule (SPL) that 
kept track of the movement number. By contrast, they found that relatively few such 
neurons were found in the lateral PFC, where we found our shape category neurons. 
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We trained monkeys to judge the number of items (1-5) in visual displays (Nieder 
et al. 2002). We trained monkeys to generalize by having them judge the number of 
items across a large number of very different looking displays (Fig. 3). We found ample 
number-tuned neurons (again, about 1/3 of the population) in the same portion of the 
lateral PFC where we had observed the shape category neurons (Fig. 4). One possibility 
for the difference between our study and that of Sawamura et al. may be modality 
(touch vs. vision), but another may be the level of abstraction. Most movement-number 
representations found by Sawamura and colleagues (85%) were not abstract: number- 
selective activity depended on whether the monkey’s movement was “push” or “turn”. 
By contrast, the visual number representations we found in the lateral PFC were abstract 
and generalized; activity was unaffected by the physical appearance of the displays. 

We also found numerosity representations in posterior cortex: there was a small 
cluster of visual numerosity neurons in the intraparietal sulcus, as predicted from 
human imaging studies. However, numerosity neurons were far fewer in number than 
what was seen in the lateral PFC (and there were no numerosity effects in the infe- 
rior temporal cortex; Nieder and Miller 2004). This finding suggests a fronto-parietal 
network for numerosity in monkeys and establishes homologies between the monkey 
and human brain. Further, it indicates that training results in large fractions of PFC 
neurons conveying category information even when the category judgments involve 
special, innate abstractions like small numbers. It may be that innate categories have 
explicit representation in posterior cortical areas like numerosity in the parietal cortex 
(or faces in the inferior temporal cortex), but then large populations of PFC neurons 
become tuned to them as a result of training. By contrast, if there are no innate rep- 
resentations, the categories are abstracted de novo in the PFC (like our cat and dog 
categories). 


Rules 


Meaningful groupings of information do not only occur in the sensory domain. We can 
lump together sets of events and actions into general guidelines, principles, or rules 
for behavior. Consider restaurants. We have knowledge of generic rules. such as “wait 
to be seated”, “order”, and “pay the bill”, that are long decoupled from the specific 
experiences in which they were learned. We then have a notion of what is expected of us 
the first time we walk into a new restaurant. As learning and applying abstracted rules 
and principles are central to complex, goal-directed behavior, a number of investigators 
have recently argued that their acquisition and representation may be a critical PFC 
function and their lack could explain the “goal neglect” that follows prefrontal damage 
(Houk and Wise 1995; Fuster 2000; Fuster et al. 2000; Miller and Cohen 2001; Wood 
et al. 2005). Neural correlates of rules like shape categories and numerosity have been 
shown to be abundant in the PFC of both monkeys and rodents (Asaad et al. 1998, 
2000; Hoshi et al. 1998; White and Wise 1999; Wallis et al. 2000; Schoenbaum and 
Setlow 2001), suggesting that the visual categories described above are a sensory-based 
manifestation of a general prefrontal function in abstracting information across many 
domains. 

When rules involve familiar cues and responses (such as “stop at red”), they are 
“concrete” and can be represented as a set of specific Stimulus-Response (S-R) associ- 
ations. With varied experience, rules can be abstracted beyond such specificity. Wallis 
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Fig. 3. Number-matching task. Monkeys were trained to match the number of items in a dis- 
play across randomly generated changes in item size and position. After training, the monkeys 
and neurons generalized across disparate displays that controlled for changes in low-level at- 
tributes 
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Fig. 4. Recording sites in the lateral PFC for the numerosity experiment, and examples of neurons 
that are tuned for number and generalize across changes in the physical appearance of the 
displays. The recording sites show the percentage of neurons at a given recording site that 
exhibited numerosity tuning. We found these neurons in virtually the same regions of the lateral 
PFC where we found the cat and dog neurons 


et al. trained monkeys to do just that (Wallis et al. 2001; Wallis and Miller 2003; Muham- 
mad et al. 2006). Monkeys alternated between applying either a “same” or “different” 
rule to pairs of pictures. After training with a wide range of pictures, they could apply 
these rules successfully upon seeing a novel picture for the first time (and therefore 
had no prior S-R association). This finding means the monkeys had learned to apply 
generalized principles. About 40% of neurons in the PFC reflected which rule was cur- 
rently being used (Fig. 5). This neural activity was also generalized: it was independent 
of which specific cue signalled the rule, not linked to the behavioral response, and 
unaffected by which pictures the monkeys were judging. PFC engagement by abstract 
rules has also been reported in human imaging studies (Bunge et al. 2003). 

We also tested the inferior temporal cortex because the ITC is presumably involved 
in visual analysis of the pictures used in the task (and, as mentioned above, is directly 
connected with the PFC). In sharp contrast, there was virtually no abstract rule infor- 
mation in the ITC (Muhammad et al. 2006). This finding parallels our work on shape 
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Fig. 5. Illustration of the firing rate of a single PFC neuron across time when the monkey is 
remembering one of four different pictures, and which of two abstract rules it should follow: 
“same” or “different”. The sample picture is shown to the monkey for 800 ms. There is then a 1500- 
ms delay before the test picture appears. At this point, the monkey must decide whether the test 
picture is the same as the sample picture or different. Typically, the firing rate of PFC neurons does 
not encode the identity of the picture but encodes which abstract rule the monkey was following. 
The neuron illustrated here showed higher activity when the monkey was remembering the 
“same” rule compared to the “different” rule. The four histograms show the neuron’s activity 
when the monkey was holding four different pictures in memory. Note that rule activity is the 
same regardless of which picture was remembered 


categories and thus adds further support to the notion that the learning of high-level, 
goal-relevant abstraction is an important frontal cortex function. 


Slow versus fast learning and abstraction 


At first blush, it might seem that the greatest benefit would come from learning that 
proceeds as quickly as possible. Fast learning has obvious advantages: one can learn to 
get to resources and avoid obstacles faster and better than competitors. But fast learning 
comes at a cost; it does not allow the benefits that come from generalizing over multiple 
experiences, so by necessity it tends to be specific. Extending learning across multiple 
episodes allows organisms to pick up on the regularities of predictive relationships 
and allows detection of common structure across many different experiences, the 
regularities and commonalities that form abstractions, general principles, concepts, 
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and symbolisms that are the medium of the sophisticated thought and the “big pictures” 
needed for truly long-term goals. Indeed, this is key to proactive thought and action. 
Generalizing among many past experiences naturally endows the ability to generalize 
to the future, to imagine possibilities that we have not yet experienced, but would like 
to. So, how does the brain balance the obvious pressure to learn as quickly as possible 
with the advantages of slower learning in forming abstractions? Some recent evidence 
suggests that this process may take place in interactions between the PFC and the basal 
ganglia. 

The basal ganglia (BG) is a collection of subcortical nuclei. Cortical inputs ar- 
rive largely via the striatum (which includes both the caudate and the putamen), are 
processed through the globus pallidus (GP), the subthalamic nucleus (STN), and the 
substantia nigra (SN), and are then directed back into the cortex via the thalamus. The 
frontal cortex receives the largest portion of BG outputs, suggesting some form of close 
collaboration between them (Middleton and Strick 1994, 2000, 2002). 

We recently compared activity in the lateral PFC and BG (specifically the caudate 
nucleus, which receives direct inputs from the PFC) as monkeys learned to associate 
a visual cue and a directional eye movement (Pasupathy and Miller 2005). Over the 
few tens of trials it took for monkeys to learn these associations, neural activity in the 
striatum showed rapid, almost bi-stable, changes compared to a much slower trend in 
the PFC. Interestingly, however, the slower, more executive PFC seemed to be the final 
arbiter of behavior; the monkeys’ improvement in selecting the correct response more 
closely matched PFC changes. 

This finding fits with observations of a striatal infrastructure ideal for rapid, su- 
pervised (reward-based) learning (Wilson and Kawaguchi 1996; Reynolds et al. 2001). 
It also supports hypotheses that learning in the frontal cortex is “trained” by the BG 
(Houk and Wise 1995; O’Reilly and Munakata 2000). Dopaminergic reward-prediction 
error signals from the midbrain (Schultz and Dickinson 2000; McClure et al. 2003) 
may allow rapid formation of reward-relevant associations in the striatum (Bar-Gad 
et al. 2003), which then trains slower, more graded, Hebbian mechanisms in the PFC 
via output nucleii of the BG and thalamus (O’Reilly and Munakata 2000; O’Reilly and 
Frank 2006). If so, this may mean that the BG is more specialized for the rapid learning 
of specific information needed to produce a desirable outcome. By contrast, the slower 
changes in the PFC may have allowed it to accumulate more evidence and more slowly 
and judiciously arrive at the correct answer, as well as detect the generalities across 
more experiences needed to form abstractions. 

The faster changes and stronger effects of the specific cue-response associations 
in the striatum of the BG than in the PFC seem consistent with this hypothesis. But 
if it is true then, by contrast, we might expect to find that abstract rules and general 
principles are more strongly encoded in the PFC. A recent experiment by Muhammad 
et al. showed just that (2006). Building on the work of Wallis et al. (2001), we trained 
monkeys to apply the abstract rules “same” and “different” to pairs of pictures. If the 
“same” rule was in effect, monkeys responded if the pictures were identical, whereas 
if the “different” rule was in effect, monkeys responded if the pictures were different. 
The rules were abstract since the monkeys were able to apply the rules to novel stimuli 
for which there could be no pre-existing S-R association. This is the definition of an 
abstract rule. Muhammad et al. recorded neural activity from the same PFC and striatal 
regions as Pasupathy and Miller (2005) and found that, in contrast to the specific-cue 
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response associations, the abstract rules were reflected more strongly in PFC activity 
(more neurons with effects and larger effects) than in BG activity, the opposite of what 
Pasupathy and Miller (2005) reported for the specific cue-response associations. 

Finally, another important aspect of the connections between the frontal cortex 
and BG is that they form “closed” anatomical loops. Anatomical tracing techniques 
have suggested that functionally similar cortical areas project into the same striosome 
(Yeterian and Van Hoesen 1978; Van Hoesen et al. 1981; Flaherty and Graybiel 1991). 
For example, both sensory and motor areas relating to the arm seem to preferentially 
innervate the same striosome. The BG maintains a degree of topographical separation 
in different “channels” throughout its nuclei, ensuring that the output via the thalamus 
is largely to the same cortical areas that gave rise to the initial inputs to the BG (Selemon 
and Goldman-Rakic 1985; Alexander et al. 1986; Parthasarathy et al. 1992; Hoover and 
Strick 1993; Kelly and Strick 2004). 

This structure suggests a recursive system where the results from one iteration 
can be fed back through the loop for further processing. Such a system is ideal for 
bootstrapping: repeated iterations can link in more and more information, building 
itself into ever-increasing elaboration and sophistication. This process may allow the 
bootstrapping of neural representations to increasing complexity and, with the slower 
learning in the PFC, greater and greater abstractions. It may also underlie a hallmark 
of human intelligence: it is easiest for us to understand new concepts if they can 
be grounded in familiar ones. We learn to multiply through serial addition and we 
understand quantum mechanics by constructing analogies to waves and particles. 
Interactions between the BG and the prefrontal cortex may support this type of cognitive 
bootstrapping. As more complex and generalized representations are learned in the 
prefrontal cortex, they are passed down through the BG for further expansion. 


The prefrontal cortex and BG: goal-directed learning 
and cognitive control 


Here is the general idea. Goal-directed thought and actions are learned via dopamin- 
ergic-gated plasticity in frontal cortex-BG loops. Fast learning mechanisms in the 
BG (specifically the striatum) are more specialized for the detection and storage of 
specific experiences that can and do lead to reward (i.e., activation of the midbrain 
dopamine signals). The output of the BG trains slower learning mechanisms in the 
frontal cortex. The slower cortical learning is not only less error-prone, but it also 
allows the frontal cortex to build up abstract, generalized, representations that reflect 
the regularities across many different experiences. Recursive iterations of these loops 
allow bootstrapping of ever-more complex, and ever-more predictive, rules and greater 
abstractions. 

In this view, the more-primitive BG is the “engine” driving goal-directed learning. 
The frontal cortex, and the PFC more specifically, is an “add on”; it evolved to add 
greater sophistication to the learning. Both the PFC and BG can drive behavior (they 
both send projections to cortical and subcortical motor system structures). However, 
the PFC may have greater leverage because it has widespread cortical connections 
and thus can influence cortical processing at many levels. By virtue of its feedback 
projections to sensory cortex, for example, the PFC can play a direct role in filtering 
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out potentially distracting sensory information (i.e., attention). Thus, the PFC and BG 
may work together as different parts of a frontal lobe system for goal-directed learning, 
but because the PFC has the big picture and can exert a top-down influence on much 
of the cortex, it may be the executive component of this system. 

It is important to note that this review was not meant to be an exhaustive survey of 
PFC and BG function, neither was it meant to imply that these are the only functions of 
these structures. For example, work by Kesner and colleagues suggest that the BG may 
select behavioral responses based, in part, on inputs from the PFC (Kesner and Rogers 
2004). Models by Bullock and Grossberg suggest that the BG has a gating function, 
allowing or disallowing expression of certain neural representations in the PFC (and 
other brain areas; Brown et al. 2004). Graybiel and colleagues emphasized the role of 
the BG in the formation of habits (Graybiel 1998). We also point the reader to the 
work of Robbins, Arnsten, and colleagues on the role of ascending neurotransmitter 
systems in, and interactions between, the PFC and BG in mediating cognitive flexibility 
(Arnsten and Robbins 2002; Robbins 2005). None of these findings is inconsistent with 
what we propose here. This review is only one slice through the multivariate functions 
of these structures, one that emphasizes their role in acquiring the knowledge needed 
for sophisticated goal-directed behavior. 

But how is this acquired knowledge used to coordinate the brain-wide processing 
needed for cognitive control? Miller and Cohen (2001) suggested that the cardinal 
PFC function is to acquire and actively maintain patterns of activity that represent 
goals and the means to achieve them (rules) and the cortical pathways needed to 
perform the task (“maps”, hence “rulemaps”; Fig. 6). Under this model, activation 
of a PFC rulemap sets up bias signals that propagate throughout much of the rest 
of the cortex, affecting sensory systems as well as systems responsible for response 
execution, memory retrieval, emotional evaluation, etc. The aggregate effect is to guide 
the flow of neural activity along pathways that establish the proper mappings between 


Fig.6. In the Miller and Cohen model, a task model is formed in the PFC when reward signals link 
together neurons activated by the events that lead to reward. A subset of cues can then activate the 
entire representation. Bias signals resulting from the maintenance of this pattern guide the flow of 
activity along neural pathways that establish the task-relevant mappings between representations 
of inputs (C1, C2, C3), internal states, and outputs (R1, R2) in posterior cortex. Thick lines and 
filled circles denote active processing lines 
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inputs, internal states, and outputs to best perform the task. Establishing the proper 
mapping is especially important whenever stimuli are ambiguous (i.e., they activate 
more than one input representation), or when multiple responses are possible and the 
task-appropriate response must compete with stronger, more habitual, alternatives. 
In short, the task information is acquired by the PFC-BG loops, which when actively 
represented in the PFC, provides support to related information in posterior brain 
systems, effectively acting as a global attentional controller, or traffic cop, if you will. 
This endows the hallmarks of cognitive control. It provides goal-direction because there 
is an explicit representation of goals and means. It allows the brain to override habits 
by the PFC providing top-down signals that bias the flow of activity in the cortex away 
from strongly established (habit) pathways and toward weaker ones that match the 
PFC goal-related representation. Finally, it provides flexibility because, if you need to 
change your goal or strategy, you do not have to rewire your whole brain. If PFC activity 
patterns dynamically establish neural pathways throughout the cortex, then changing 
your mind is as easy and fast as changing the PFC activity pattern. Change the PFC 
pattern and the rest of the cortex falls in line. 
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Summary 


Individuals with early Alzheimer’s disease (AD) suffer from a selective and profound 
failure to form new memories. A novel molecular mechanism with implications for 
therapeutics and diagnostics is now emerging in which the specificity of AD for mem- 
ory derives from disruption of plasticity at synapses targeted by neurologically active 
AB oligomers. We have named these oligomers “ADDLs” (for pathogenic AB-derived 
diffusible ligands). ADDLs constitute metastable alternatives to the disease-defining 
AP fibrils deposited in amyloid plaques. In AD brain, ADDLs accumulate primarily 
as AB 12-mers (~54 kDa). The same size oligomers occur in tg-mouse AD models; 
in mice, these 12-mers appear concomitantly with memory failure, consistent with 
the ability of ADDLs to inhibit long-term potentiation (LTP) and block reversal of 
long-term depression (LTD). Mechanistically, ADDLs are gain-of-function ligands that 
bind with specificity to particular synapses, targeting synaptic spines. Binding leads 
to a rapid and ectopic expression of the memory-linked immediate early gene Arc. 
Such aberrant accumulation has been linked by others to memory dysfunction in tg- 
Arc mouse models. Consistent with the expected consequences of Arc overexpression, 
ADDLs promote loss of surface NMDA receptors and anomalous spine morphology, 
which are responses expected to contribute to plasticity failure and memory dysfunc- 
tion. Importantly, the attack on synapses provides a putative mechanism that unifies 
AD memory dysfunction with major features of AD neuropathology. Recent findings 
show ADDL binding instigates synapse loss, AD-type tau hyperphosphorylation, and 
generation of reactive oxygen species (ROS). Binding sites for ADDLs are at or in the 
close vicinity of NMDA receptors. Antibodies against external domains of NMDA re- 
ceptors reduce ADDL binding and inhibit ADDL-stimulated ROS formation. The ROS 
response also is inhibited by memantine, an open-channel blocker of NMDA receptors 
recently approved for AD therapeutics. The ability of memantine to contravene the im- 
pact of ADDLs offers a new mechanism to explain why an NMDA receptor antagonist 
should improve memory function in AD patients. Elimination of ADDLs by vaccines 
now under development could provide the first AD treatments that are truly disease- 
modifying. In addition to establishing a molecular mechanism of significant value for 
AD therapeutics and diagnostics, studies of ADDL interactions with synaptic pathways 
and control mechanisms ultimately may provide new insights into the extraordinary 
complexities of physiological synaptic information storage. 


1 Cognitive Neurology & Alzheimer’s Disease Center, Department of Neurobiology and Physiol- 
ogy Northwestern University Institute for Neuroscience, 2205 Tech Drive, Evanston, IL 60208, 
USA, wklein@northwestern.edu 


Bontempi et al. 
Memories: Molecules and Circuits 
© Springer-Verlag Berlin Heidelberg 2007 


156 W.L. Klein et al. 


Individuals with early Alzheimer’s disease (AD) specifically suffer from a profound 
failure to form new memories. As the disease progresses, retrieval of stored memories 
also fails, and the dementia grows increasingly broad and catastrophic. A sense of 
what this means for people with the disease comes from a story involving Auguste D, 
the subject in the first case study of AD 100 years ago (Alzheimer 1906). Auguste D 
was asked in a visit by Dr. Alzheimer to write her name. Unable to do so, she said 
instead, “I have lost my self...”. Fortunately, in this centennial year of Alzheimer’s 
seminal study, the means to block this profound loss of self may not be far away. 
We can begin to envision disease-modifying therapeutics that might stop progression 
to end-stage disease, perhaps even restoring memory function to normalcy in early- 
stage disease. While these prospects appear optimistic, they stem from increasingly 
instructive insights into the molecular mechanisms of AD pathogenesis. 

As we attempt to define disease mechanisms, there are two critical benchmarks 
that must be satisfied. Ultimately, the correct theory must explain why early AD targets 
memory and it must account for AD’s pathological hallmarks. Satisfying these bench- 
marks is challenging, and at times it has been suggested that AD could be multiple 
diseases. Nonetheless, as reviewed below, evidence is accumulating to indicate that 
memory loss and neuropathology might be attributable to the toxic impact of soluble 
oligomers of AB, the same 42-amino acid peptide that generates the fibrils found in 
amyloid plaques. Much smaller than the large insoluble fibrils, the oligomers are just 
a little larger than 50 kDa in mass, i.e., similar to an average-sized globular protein. 
As reviewed below, Af oligomers attack synapses with great specificity, disrupting 
synapse shape, composition and the mechanisms of plasticity. Newer findings show 
that oligomers also trigger neuronal damage characteristic of the major features of AD 
neuropathology: synapse loss, tau hyperphosphorylation, oxidative stress, and ulti- 
mately nerve cell death. Oligomers thus seem to be appealing candidates for the hidden 
toxins of AD memory loss and neuropathology. 


The original amyloid cascade hypothesis: fibrils and nerve cell death 


Early findings from nerve cell biology, human genetics and neuropathology provided 
an exceptionally strong foundation for the concept that AB-derived fibrillar toxins 
play a primary role in AD pathogenesis. This hypothesis, summarized in 1992 in 
a seminal review by Hardy and Higgins, has been extraordinary in its ability to generate 
experimental progress. The benchmark evidence for the amyloid cascade (coupled with 
more than 15,000 citations to AB in PubMed) indeed seems compelling: 


AD is linked by human genetics to the AB peptide. 

Af in vitro self-assembles into lethal neurotoxins. 

toxic AB preparations contain abundant, readily-detectable fibrils. 
fibrils formed in vitro mimic those found in AD plaques. 

AD is accompanied by significant nerve cell death. 


These findings generated a simple and persuasive concept - AD is, in essence, the 
consequence of neuron death induced by insoluble deposits of large amyloid fibrils. The 
original amyloid cascade has been a powerful hypothesis, but despite its compelling 
support, it misses the mark in explaining memory loss in early AD. Critical flaws are 


ADDLs - Molecules that Disrupt Memories in Alzheimer’s Disease 157 


epitomized by findings in two transgenic mouse vaccine studies published in 2002. 
These studies (Dodart et al. 2002; Kotilinek et al. 2002) tested the therapeutic impact of 
monoclonal antibodies against AB. Consistent with the amyloid cascade hypothesis, the 
mouse models expressing the human amyloid precursor protein (APP) manifested both 
age-onset plaques and memory failure. However, the vaccination produced results that 
were strikingly inconsistent with the hypothesis. First of all, memory loss was found 
to be reversible. One study showed reversal after about two weeks (Kotilinek et al. 
2002), the other in as little as 24 hours (Dodart et al. 2002). Secondly, despite reversing 
memory loss, the AB antibodies did not reduce amyloid plaque burden. These models 
of early AD thus manifest an AB-dependent memory loss that is not the consequence of 
neuron death and is not instigated by the presence of amyloid plaques. In fact, the lack 
of correlation between plaque burden and dementia in humans has long been noted 
by neuropathologists (Katzman et al. 1988), who have considered this a major flaw in 
the original amyloid cascade hypothesis. 


The alternative to fibrillar toxins: neurological damage 
by soluble Af oligomers 


If plaques do not initiate the disease, how does one explain the consequences of mutated 
APP or the therapeutic efficacy of anti-AB monoclonals? Newer findings indicate that 
the crucial molecules are not found in senile plaques and that the neurological impact of 
these molecules significantly precedes neurodegeneration. While retaining the central 
role for AB-derived toxins, these findings strongly support a new concept: early memory 
loss comes from failure of synaptic plasticity, not neuron death, and the fundamental 
molecular pathogens are small soluble Af oligomers, not fibrils. In other words, fibrils 
are not the only toxins derived from Af, and probably not the most important ones. How 
these concepts emerged and underlie new opportunities for early-stage diagnostics and 
disease-modifying therapeutics is reviewed in the remainder of this chapter. 

An early clue came from experiments by Finch and colleagues (Oda et al. 1994, 
1995) on the interaction of AB with ApoJ (clusterin), an upregulated, plaque-associated 
molecule in AD brain. Rather than promoting plaque formation, ApoJ proved surpris- 
ingly effective at blocking AB from forming large aggregates. Inhibition of aggregation 
occurred even at substoichiometric doses of ApoJ (1 part ApoJ, 20 parts AB), suggesting 
a chaperone-like action. Toxicology experiments with the PC12 pheochromocytoma 
cell line provided an even bigger surprise. ApoJ-AB solutions, which lacked sedi- 
mentable aggregates of AB, remained highly effective at blocking MTT reduction, an 
indicator of vital mitochondrial function and vesicle trafficking. The Apo] experiments 
provided the first indication that AB-containing structures other than large insoluble 
fibrils might be pathogenic. 


A different outcome for Af self-assembly: globular molecules rather 
than fibrillar aggregates 


The nature of the AB assemblies produced in the presence of ApoJ was first investigated 
by our group (Lambert et al. 1998), leading to the discovery that toxin-producing AB 
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self-assembly does not lead inexorably to large fibrillar aggregates or even protofibrils. 
An alternative outcome comprises small globular molecules only several nanometers in 
diameter. Atomic force microscopy (AFM) images of ApoJ-Aß solutions ruled out the 
presence of fibrils or short rod-like protofibrils, showing only small structures roughly 
comparable in dimensions to soluble globular proteins smaller than 100 kDa. Unlike 
the readily observed fibrils, such nanoscale structures would be cryptic to conventional 
neuropathology and easily missed in electron microscopy. 

Alternative methods without ApoJ have been developed to generate fibril-free, 
globular AB assemblies. In our original approach (Lambert et al. 2001; Klein 2002), 
AB42 was monomerized by HFIP, dissolved in fresh DMSO, diluted into cold F12 culture 
medium, and briefly centrifuged. The supernatants were completely free of protofibrils 
(Chromy et al. 2003). Even after 24 hours at 37 degrees, micromolar solutions exhibited 
no rod-like protofibril structures, establishing that the globular molecules seen in AFM 
were at least metastable. Most importantly, these ApoJ-free solutions were neurotoxic, 
proving that the active molecules were homo-oligomers of AB, not ApoJ- AB complexes. 


Globular toxins are built of SDS-resistant oligomers 


The composition of the globular neurotoxins has been analyzed by gel chromatog- 
raphy and electrophoresis (Lambert et al. 2001; Chromy et al. 2003). SDS-resistant 
AB42 oligomers typically range from dimer to 24-mer. AB40, which physiologically 
is much more abundant than AB42, does not generate stable oligomers (Levine 1995, 
2004). The spectrum of oligomers resolved by electrophoresis is much more evident by 
Western blot than silver stain, although Western blot patterns are markedly antibody- 
dependent. The commonly used 4G8 monoclonal, for example, is unusually sensitive 
to dimers while poorly recognizing mid- to large-sized oligomers (Lambert et al. 2001; 
Chromy et al. 2003). Other experimental conditions also influence the analysis. For 
example, some oligomers are stable in SDS at room temperature, but they break down 
when subjected to boiling, a step often used in SDS-PAGE. Although it has been sug- 
gested that SDS-PAGE itself might generate small oligomers, in fact monomers sans 
oligomers are readily detected by SDS-PAGE using fresh preparations (Chromy et al. 
2003). Oligomerization occurs quickly, however, even at low concentration. Oligomers 
form from 10 nM solutions of monomer within minutes (Chang et al. 2003), and the 
process of immunoblotting will generate oligomers from concentrated monomers if 
SDS is not constantly present. 

Not all oligomeric states are equally abundant, and oligomer assembly itself is 
extremely sensitive to in vitro conditions, presumably reflecting the conformational 
dynamics of AB. Smaller SDS-resistant species (3- and 4-mers) are favored in solu- 
tions kept at 4 degrees, but stable 12-mers become prominent when dilute solutions of 
tetramers are incubated at 37 degrees (Klein 2002). Conformation differences, more- 
over, exist between oligomers of the same size. For example, SDS-stable oligomers have 
been observed that are not toxic and are unreactive with antibodies that bind toxic 
species (Chromy et al. 2003). Physiologically relevant factors also have been identi- 
fied that affect oligomerization, including the oligomer-promoting action of divalent 
cations (Huang et al. 2004) and levuglandin (Boutaud et al. 2005). Overall, however, 
the mechanism of oligomer formation, the stability of oligomers, and the relationship 
between oligomerization and fibrillogenesis still remain poorly characterized. 
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Neurological impact: oligomers rapidly block synaptic information storage 


Metastable AB ogligomers have generated major interest because of their remark- 
able impact on mechanisms germane to memory loss. Memory formation begins at 
synapses, so one might anticipate that destruction of memory formation in AD itself 
begins at synapses. A crucial question, therefore, has been whether oligomers might 
compromise synaptic plasticity. The answer has been clear-cut. In LTP and LTD, the 
classic paradigms for learning and memory, Af oligomers are neurologically potent 
CNS toxins that rapidly disrupt synaptic information storage. 

Initial experiments (Lambert et al. 1998; Klein 2001) tested stereotaxic injections 
of ApoJ-derived oligomers on LTP in living mice. Although neither ApoJ alone nor 
amyloid fibrils had any impact, the injected oligomers produced a rapid and profound 
inhibition. Inhibition was confirmed and extended in brain slice experiments in which 
oligomers prepared without ApoJ completely block LTP at sub-micromolar concen- 
trations (Lambert et al. 1998). Effects occurred within an hour and were selective for 
particular aspects of plasticity. While oligomers inhibit LTP, they do not inhibit LTD 
(Wang et al. 2002). On the other hand, oligomers do block the reversal of LTD. The net 
neurological effect of oligomers in the hippocampus thus is to repress positive synaptic 
feedback. To account for these altered states of plasticity, it has been hypothesized that 
oligomers disrupt glutamate receptor trafficking (Gong et al. 2003; Lacor et al. 2004a), 
which is essential for both LTP maintenance and LTD reversal (Sheng and Lee 2001). 
Recent evidence supports this possibility (Lacor et al. 2004b, 2005; Snyder et al. 2005). 
A mechanism to account for disrupted trafficking is discussed at the end of the article. 

The initial impact of oligomers on synaptic plasticity is rapid and selective. There 
is no effect, for example, on evoked action potentials, so the mechanism is non- 
degenerative, at least at first. These findings led to a prediction that was somewhat 
iconoclastic: if oligomers were responsible for memory loss in early AD through an im- 
pact on synaptic plasticity, then early memory loss should be reversible (Lambert et al. 
1998; Klein 2001). As discussed earlier, this prediction was verified in the impressive 
vaccination experiments on mouse models (Dodart et al. 2002; Kotilinek et al. 2002). 


The amyloid cascade hypothesis revisited: memory loss in early AD 
is a synaptic disease caused by soluble Af oligomers 


LTP is not memory but is widely recognized as a good experimental paradigm for the 
study of memory mechanisms. The rapid impact of oligomers on LTP thus is intuitively 
appealing in its relevance to AD, and in 1998 we proposed a new hypothesis that at- 
tributed early memory loss to oligomer-induced failure in synaptic plasticity (Lambert 
et al. 1998). The experimental foundation for this concept has been substantiated in 
multiple investigations (Chen et al. 2000; Vitolo et al. 2002; Klyubin et al. 2004, 2005; 
Wang et al. 2004a; Costello et al. 2005; Nomura et al. 2005; Puzzo et al. 2005; Trommer 
et al. 2005; Walsh et al. 2005), with especially strong support found in a major study 
of LTP by Walsh, Selkoe and colleagues (Walsh et al. 2002). That group showed that 
oligomers found in medium conditioned by hAPP-transfected CHO cells are excep- 
tionally potent inhibitors of LTP in vivo. Western blots indicate that the cell-derived 
oligomers comprise mostly small SDS-stable oligomers, free of structures that might 
be considered protofibrils, although given antibody selectivity and gel conditions it 
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is unclear whether mid-sized oligomers might also be present (e.g., 9-24-mers). As 
predicted, controls using insulin-degrading enzyme, which proteolyzes monomers but 
not oligomers, ruled out the possibility that monomeric Af contributed to inhibition, 
whereas drugs that blocked AB production also blocked accumulation of the neuro- 
logically active molecules. 

The compelling verification that oligomers are neurologically significant and suffi- 
ciently stable to accumulate in a cell-conditioned media, along with the vaccine results 
validating the prediction that memory loss is reversible, provided impetus for signif- 
icant modifications in the amyloid cascade hypothesis (Hardy and Selkoe 2002). The 
new cascade includes two significant emendations: 1) early memory loss now is at- 
tributed to synapse failure, not neuron death, and 2) synapse failure is attributed to AB 
oligomers, not amyloid fibrils. 

The hypothesis that AB oligomers play a role in AD pathogenesis has stimulated 
interest in the broader possibility that toxic protein oligomers may be common to 
multiple diseases (Klein 2005). Af is one of more than 20 different fibrillogenic proteins 
that are disease-linked and, until recently, it had been assumed that the pathogenic 
molecules were fibrillar. However, a number of these proteins now have been found 
to generate sub-fibrillar, oligomeric cytotoxins (Conway et al. 2000; Butler et al. 2003; 
Reixach et al. 2004). In some cases, contrary to earlier dogma, oligomers exhibit 
cytotoxicity but fibrils do not. Extrapolation from AB as a specific case study thus is 
providing general insight into diseases of protein folding and mis-assembly. Strategies 
that target elimination of soluble oligomeric toxins could provide new therapeutics for 
a broad range of significant diseases. 


Clinical substantiation that AD pathology includes neurologically 
active AB oligomers 


If soluble AB oligomers are to be valid targets for AD therapeutics, they must be 
present in human brain and manifest a strong AD-dependent accumulation. All the 
evidence reviewed so far comes from experimental models and oligomeric toxins 
generated in vitro. The critical question is whether AD-affected brains contain identical 
neurologically active AB oligomers. 

Early evidence indicated that pre-fibrillar assemblies indeed existed in AD-affected 
brains (Frackowiak et al. 1994; Kuo et al. 1996). Since fibrils were regarded as the initia- 
tors of pathogenesis, it was assumed the oligomers simply were surrogates for ongoing 
fibrillogenesis. The oligomers were considered transient intermediates en route to for- 
mation of the pathogenically relevant amyloid fibrils. However, given the potential of 
the oligomer hypothesis to account for AD memory loss, significant efforts recently 
have sought to substantiate the AD-dependent accumulation of brain oligomers. 

Because not all oligomers are toxic (Chromy et al. 2003), it has been important to 
determine whether brain-derived oligomers are structurally equivalent to the neuro- 
logically disruptive, laboratory-derived oligomers. The key tools for rigorous analysis 
have been conformation-dependent antibodies. Such antibodies are readily generated 
by oligomer-based vaccines (Lambert et al. 2001), which are highly immunogenic 
and superior to short peptides for generating antibodies recognizing conformational 
epitopes. Polyclonal antibodies have been obtained that are at least 1000 times more 
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sensitive for oligomers than monomers (Lambert et al. 2001; Chang et al. 2003) and 
are capable of detecting less than 0.1 fmole oligomerized AB in dot immunoblots. 
Monoclonal antibodies have been obtained with similar properties (Lambert et al. 
2003, Lambert et al. 2007), whereas some show unique attributes. One interesting 
monoclonal, for example, binds oligomers but not fibrils (Chromy et al. 2003), whereas 
another binds fibrils but not oligomers (Lambert et al. 2007), suggesting that fibrils are 
not simple assemblies of oligomeric subunits. A second approach to antibody genera- 
tion by Glabe and colleagues has been to use oligomers coupled to gold particles (Kayed 
et al. 2003). These immunogens have generated oligomer-selective antibodies with an 
intriguing ability to recognize oligomers of multiple fibrillogenic proteins besides AB 
(e.g., alpha synuclein and IAPP); it has been proposed, therefore, that particular struc- 
tural domains may be common to toxic protein oligomers in general (Kayed et al. 2003). 
The generic oligomer antibodies do not bind fibrils, consistent with the concept that 
structures of non-fibrillar and fibrillar toxins are not related in a simple manner. 


Oligomers occur in human brain and are strikingly elevated in AD 


Assays based on oligomer-selective antibodies have provided compelling evidence that 
oligomers are bona fide components of AD pathology: oligomers occur in human brain 
and are strikingly elevated in AD; oligomers from AD brain are structurally homolo- 
gous to synthetic oligomers; and hAPP mouse models of AD accumulate oligomers as 
predicted. Oligomers found in CSF, moreover, appear to provide a significant biomarker 
of AD that is potentially of value for future clinical diagnostics. 

To assess soluble oligomers, brain extracts have been prepared using physiological 
buffers to avoid chemical perturbation. Soluble fractions, obtained after ultracentrifu- 
gation and analyzed by dot immunoblots, show a remarkable, AD-dependent increase 
in oligomers. AD subjects show levels up to 70-fold higher than those found in controls 
(Gong et al. 2003), verified with three different oligomer-selective antibodies (Kayed 
et al. 2003; Lacor et al. 2004a). The large increase in extractable oligomers is regionally 
selective, robust in frontal cortex but marginally detectable in cerebellum (Lacor et al. 
2004a; Fig. 1), a pattern consistent with other aspects of AD pathology. 

With respect to structure, three types of data indicate that soluble oligomers in brain 
extracts are equivalent to the toxic oligomers prepared in vitro. First, brain oligomers re- 
act with conformation-dependent antibodies raised against synthetic oligomers. These 
conformation-dependent antibodies not only distinguish oligomers from monomers 
but also distinguish toxic oligomers from non-toxic species; neither capacity is typical 
of antibodies generated against short peptides (Lambert et al. 2001). Second, synthetic 
and AD brain-derived preparations show equivalence by two-dimensional gel analysis. 
Each contains a prominent oligomer with isoelectric point of ~ 5.6 and SDS-stable 
mass of ~ 54 kDa, i.e., 12-mers (Gong et al. 2003). Additional species have been ob- 
tained by extracting brain with SDS, indicating some oligomers are trapped or tightly 
bound. Third, in cell culture experiments discussed below, both synthetic and soluble 
AD brain-derived oligomers show attachment to nerve cell surfaces in identical, ligand- 
like patterns (Gong et al. 2003; Lacor et al. 2004a). No binding occurs with extracts from 
control brain. Overall, the data indicate that neurologically active oligomers formed in 
vitro are structural homologs of oligomers that accumulate in AD brain. 
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Fig. 1. AB oligomers are deposited extracellularly around neurons and are highly elevated in 
Alzheimer’s-affected cortex. Clinical substantiation of ADDL involvement in AD pathology has 
come from immunomicroscopy and immunochemistry. Cortical sections from AD brain treated 
with ADDL-generated antibodies show diffuse synaptic-type labeling surrounding individual 
pyramidal neurons (fluorescent- or peroxidase-conjugated secondary antibodies, (A) and (B), 
respectively). Similar staining is evident in MCI (mild cognitive impairment). Immunodot-blots 
of soluble brain extracts confirm the AD-dependent accumulation of ADDLs (C). ADDL pathology 
is pronounced in cortex but not cerebellum. (Reprinted from Lacor et al. 2004a) 


TG mouse models of AD accumulate soluble oligomers 


Animal models of AD are essential for studies of mechanism and drug discovery, so 
it has been important to determine whether they also accumulate oligomers. Early 
evidence suggested indirectly that oligomers should be present. For example, some 
hAPP transgenic mouse strains manifest brain damage (synapse loss, deficiencies in 
LTP, poor memory performance) that does not correlate with plaque burden, in some 
cases despite the absence of amyloid deposits. These animals essentially recapitulate 
the poor correlation between plaques and dementia in humans, but in an extreme man- 
ner. Although originally rejected as inadequate models because they lacked amyloid 
plaques, some of these strains in fact may be excellent models for early AD caused by 
toxic oligomers. 
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In harmony with human data, the occurrence of oligomers has recently been 
confirmed in multiple strains of tg mice. Even when the commonly used 4G8 anti- 
body shows no increase, assays with oligomer-selective antibodies detect increases in 
immunoreactivity that reach several-hundred-fold over controls (Chang et al. 2003). 
These robust increases in oligomers are regionally selective, indicating that broad dif- 
fusion does not occur. In the highly regarded triple transgene model of Oddo et al. 
(2003), which exhibits AD-like tau phosphorylation as well as plaques, two different 
oligomer-selective antibodies showed the same pattern of developmental increase, with 
a transient peak in oligomers at six months and a sustained elevation after 15 months 
(Oddo et al. 2006). 

Quantitative correlation of oligomer levels with memory dysfunction was first ev- 
ident in experiments done in collaboration with Karen Ashe’s group (Westerman et al. 
2002), in which memory loss occurred after oligomer levels increased past certain 
threshold levels, a possibility consistent with the concept of cognitive reserve (Mesu- 
lam 1999). More recently, that group has reported memory loss correlates best with 
the accumulation of SDS-stable 12-mers (Lesne et al. 2006). This important finding 
confirms that memory dysfunction in tg mouse models is linked to the same oligomers 
found by Gong et al. in AD brain. Another mouse model, developed by Ohno et al. 
(2005), also shows highly upregulated soluble oligomers; in these animals, knockout 
of B-secretase eliminates oligomers and blocks deficiencies in a fear-linked memory 
task. 

The abundance of oligomers resulting from transgene expression in these animals 
likely explains why brain deficiencies correlate poorly with amyloid plaques, and it 
substantiates their value as models for early AD pathology and memory loss. 


Human neuropathology: distribution of oligomers in situ is consistent 
with links to AD onset 


Immunohistochemistry with oligomer-selective antibodies readily distinguishes AD 
brain sections from controls (Lambert et al. 2007). Besides confirming the presence of 
oligomers, results reveal an unexpected specificity to oligomer distribution in situ. In 
lightly stained AD brain sections, oligomers show a perineuronal localization (Lacor 
et al. 2004a), indicating the predominant localization is not cytoplasmic (Fig. 1). Al- 
though oligomers accumulate in the cytoplasm of certain tg mouse strains (Takahashi 
et al. 2004), this could be the consequence of very high oligomer production in mice. 
Oligomers also do not co-localize with the fibrils of compact plaques (Kayed et al. 2003). 
Perineuronal distribution likely corresponds to so-called diffuse deposits, a nomen- 
clature that may be misleading with respect to cellular mechanisms. “Diffuse deposit” 
implies a non-selective extracellular precipitate, but the distribution surrounding indi- 
vidual neurons suggests that oligomers may associate with particular sites in dendritic 
arbors. Perineuronal distribution is readily observed for isolated, individual neurons, 
suggesting that the source of oligomers may be that same neuron. 
Perineuronal/dendritic staining around isolated neurons appears to be the first 
sign of pathology in preclinical brain sections (Bigio et al. 2005). This fact, along with 
observations that soluble oligomers in control brain extracts sometimes show minor 
elevations (Gong et al. 2003), indicates that oligomer accumulation begins preclinically. 
The relationship between accumulation and dementia is unlikely to be straightforward, 
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with current results indicating that dementia would manifest only after oligomer levels 
exceed some critical threshold. Recently developed plaque-imaging agents (Klunk et 
al. 2004) probably will be inadequate to detect oligomers at threshold levels because 
oligomer distribution is distinctly segregated from thioflavin-positive plaques (Kayed 
et al. 2003). However, alternative imaging agents that specifically target oligomers are 
under development (Montalto et al. 2004) and might provide a means for early AD 
detection. 


Nanotechnology-based assays of cerebrospinal fluid confirm 
the extracellular presence of oligomers 


The occurrence of oligomers in soluble brain extracts, along with their perineuronal 
distribution, provides indirect evidence that oligomers occur extracellularly and thus 
are potentially accessible for clinical assays. If extracellular localization exists, then 
oligomers should occur in CSF, but neither ELISAs nor dot immuno-blots verify this 
prediction, even using the most sensitive oligomer-generated antibodies. However, 
in a recent and unique approach using nanotechnology, we have succeeded in in- 
creasing sensitivity by orders of magnitude while still retaining immunospecificity 
(Georganopoulou et al. 2005). With this new assay, CSF oligomer levels show marked 
AD dependence, with median levels for AD subjects elevated 10-fold over controls 
(p < 0.0001 for 30 samples). Overlap between populations is virtually non-existent. 
Although a much larger sample is needed to validate these results, and the current as- 
say itself is still being enhanced, the use of oligomers as biomarkers in AD diagnostics 
shows unique promise. 


Cellular mechanism - oligomers specifically target synapses 
and disrupt the molecular cell biology of memory 


Oligomers block synaptic information storage in electrophysiological experiments. 
They trigger AD-type pathology in transgenic models and likely instigate mem- 
ory loss. Furthermore, in human brain, oligomers show a striking accumulation 
around neurons that begins in very early stages of AD. There clearly is a need to 
understand how oligomers act in terms of molecular mechanisms. How is it that 
oligomers attack neurons, and what memory-relevant molecular changes are trig- 
gered? An appealing hypothesis is suggested by the putative association of oligomers 
with dendritic arbors in situ: perhaps oligomers attack and disrupt signaling path- 
ways specifically at sites critical to memory formation, in essence acting as pathogenic 
ligands. 


Patterns in culture recapitulate patterns in vivo: oligomers are “AB-derived 
diffusible ligands” (ADDLs) that bind to hot-spots on dendritic surfaces 


One would predict, if the ligand hypothesis were true, that incubation of cultured neu- 
rons with soluble oligomers from AD brains should generate a perineuronal pattern 
analogous to that observed in brain sections. Experiments with highly differentiated 
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monolayers of rat hippocampal neurons, using oligomer-selective antibodies to local- 
ize distribution, have confirmed this prediction (Gong et al. 2003; Lacor et al. 2004a). 
Oligomers in crude extracts from AD brains show remarkably patterned attachment 
to neuron surfaces, localizing at small hot-spots that are especially rich in mature den- 
dritic arbors. CSF from AD subjects also gives this pattern, although binding is faint, 
consistent with very dilute oligomer levels. Patterns of identical perineuronal hot-spots 
are generated when neurons are incubated with synthetic oligomers. Extracts from con- 
trol brain, however, show no binding. The selective binding of AB oligomers, whether 
from AD brain or prepared in vitro, substantiates their ligand nature. This property, 
coupled with their neurological impact and their solubility, has led to the acronym 
“ADDL” (pronounced “addle”), for a pathogenic “AB-derived diffusible ligand”. 

The pattern of binding to cultured neurons supports the hypothesis that diffuse 
deposits in AD originate with oligomers bound to particular sites in dendritic arbors. 
If oligomers are gain-of-function pathogenic ligands, a key question is how specific is 
their target? In fact, in any given hippocampal culture, oligomers attack only a subset 
of neurons, typically between 25% and 50% of those present. Neurons side-by-side 
in culture can be ADDL-positive and ADDL-negative (Lacor et al. 2004a), even with 
similar morphology and expression of memory-relevant molecules such as CaMKII. 
What discriminates them is not yet known. Binding sites, however, are sensitive to low- 
doses of trypsin (Lambert et al. 1998), indicating the existence of particular proteins 
that serve adventitiously as oligomer receptors. 


A key to memory loss - oligomers (ADDLs) specifically target synapses 


The identity of the hot-spots themselves may be the key to memory loss mechanisms. 
More than 90% of ADDL hot-spots co-localize with puncta of PSD-95 (Lacor et al. 
2004a), a major component of post-synaptic densities that, in mature hippocampal 
cultures, is a reliable marker for post-synaptic terminals (Allison et al. 2000). Colo- 
calization of ADDLs with PSD-95 establishes that ADDLs are ligands that specifically 
target synapses. Roughly 50% of the synapses do not bind ADDLs, so specificity ex- 
tends also to the nature of the synapses. The fact that ADDLs are ligands that directly 
and specifically attack synapses provides substantial strength to the hypothesis that 
memory loss could be an oligomer-induced failure of synaptic plasticity. 

Importantly, synaptic binding is evident for ADDLs obtained from AD brain as 
well as prepared in vitro. Simple fractionation of brain extracts or synthetic ADDLs 
indicates the ligands are the 12-mers identified by two-dimensional gel analysis (Lacor 
et al. 2004a). 


Sub-synaptic targeting: ADDLs bind to synaptic spines and disrupt translation 
of Arc, an immediate early gene essential for long-term memory formation 


Confocal immunofluorescence microscopy shows that the synaptic targeting of ADDLs 
frequently localizes to dendritic spines, as illustrated here by a neuron double-labeled 
for ADDLs and CaMKII (Fig. 2). While other sites also may be targeted, spines are 
particularly interesting because they contain the post-synaptic signal transduction 
components of excitatory synapses. Signaling events within spines orchestrate the 
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Fig. 2. Localization of ADDL binding sites to dendritic spines. ADDLs act as pathogenic gain- 
of-function ligands that target synapses. Here, ADDLs have been added to cultures of highly 
differentiated hippocampal cells (21 DIV), which then were immunolabeled for ADDLs (green) 
and aCaMKII (red). In overlaid images, ADDLs show a highly patterned distribution seen to 
coincide (yellow) with dendritic spines. More than 90% of the ADDL puncta occur at synapses. 
Scale bars: Top, 40 um; Bottom insets, 8 um. Adapted from (Lacor et al. 2004a) 


molecular plasticity essential for initiating memory formation. CaMKII, for example, 
plays a critical role in synaptic information storage and is enriched in synaptic spines. 
An important feature of spines is their dynamic cytoskeleton (Carlisle and Kennedy 
2005). Spines are rich in actin and actin-associated proteins, and this molecular machin- 
ery regulates spine geometry and also the trafficking of glutamate receptors, essentially 
the underpinnings of synaptic plasticity. 

Experiments focusing on the relationship between ADDLs and an actin-associated 
protein in synaptic spines have proven especially salient. This protein is Arc (acronym 
for activity-regulated cytoskelet on protein), the product of an immediate early gene 
required for long-term memory formation (Guzowski et al. 2000)). Arc mRNA traffics 
specifically to spines, where its translation is induced by synaptic transmission in 
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a transient manner (Steward and Worley 2002). Chronic over-expression has been 
predicted to impair long-term memory, essentially by increasing noise in information 
processing (Guzowski 2002) and, in Arc tg mice, chronically high levels of Arc in 
these animals correlate with poor performance in learning tasks (Kelly and Deadwyler 
2003). 

When neurons are exposed to ADDLs, Arc is ectopically induced. After five minutes, 
there is an induction of small Arc puncta that coincides with ADDL-labeled spines 
(Lacor et al. 2004a). This response, however, is not transient. After six hours, Arc 
immunoreactivity remains markedly elevated and spread ectopically throughout the 
dendritic arbor. ADDLs thus provoke chronic Arc overexpression, a specific molecular 
anomaly associated with dysfunctional learning in tg mice. 

We have considered how dysfunctional learning might be caused by particular 
synaptic changes resulting from Arc overexpression. Since Arc is an F-actin asso- 
ciated protein, we predicted that its chronic elevation might cause aberrations in 
glutamate receptor trafficking and spine shape (Lacor et al. 2004a), factors linked 
to F-actin that are expected to be critically associated with synaptic plasticity. We 
previously suggested that receptor trafficking dysfunction might be related to the 
deficiencies in LTP maintenance and LTD reversal caused by ADDLs (Gong et al. 
2003), and Kelly and Deadwyler (2003) have suggested the learning dysfunction 
found in their transgenic Arc mice is due to actin-dependent loss of spine structural 
plasticity. 


Veh3h AD3h Veh3h AD3h 
NMDA-R1 NMDA-R2B 


Fig. 3. NMDA-R subunits NR1 and NR2B levels are reduced after ADDL treatment in rat primary 
hippocampal neurons. Twenty-one-day-old hippocampal neurons were exposed to 500 nM ADDL 
for three hr and fixed while permeabilization steps were excluded to ensure labeling of surface 
receptors. Labeling was done using antibodies raised against the extracellular domain of the 
receptor subunits NR1 and NR2B (ImmuQuest, 1:200 and Santa Cruz, 1:200, respectively). High 
magnification images of dendritic branches were collected by confocal microscopy, and the 
numbers of NMDA-R-immunoreactive puncta were quantified using Metamorph software (Meta 
Image Series, Universal Imaging Corporation). As shown in the representative image, the density 
of NR1- and NR2B-positive puncta (number of puncta per length of dendrites) was significantly 
reduced, by 70%, after ADDL treatment (p < 0.01) 
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Fig. 4. ADDL exposure affects drebrin-labeled dendritic spine morphology and density. Primary 
rat hippocampal neurons (21 days old) were treated with 500 nM ADDLs or with vehicle, fixed, 
and double-labeled as previously published (Lacor et al. 2004) for oligomer (M94, 1:2000) and for 
drebrin (Stressgen, 1:500). Representative images collected using a Leica LCS confocal microscope 
are reconstructions of a stack of z sections captured at 0.5-um intervals of a hippocampal neuron. 
Dendritic spine distribution (dendritic spines are identified as drebrin-positive puncta) after 
5 min, 6 hr and 24 hr of ADDL (A, B and C) or after 24 hr of vehicle treatment (D). Spine deficits 
were detected in 6-hr and 24-hr ADDL-treated neurons compared to 5-min ADDL-treated and 
24-hr vehicle-treated neurons. The right column represents magnified dendritic spines depicting 
mushroom-type or stubby spines (in controls) and dendritic protrusions that are markedly 
longer, reminiscent of dendritic filopodia, and sometimes branched protrusions. After 24 hr, 
severe spine disappearance is observed in ADDL-treated cells 


Predictions validated: ADDLs alter spine composition and shape 


We recently have validated the prediction that ADDLs exert a significant impact on 
both receptor levels and spine shape. Exposing neurons to ADDLs causes a 70% de- 
crease in NR1 and NR2B subunits of NMDA-type glutamate receptors from the plasma 
membrane (Fig. 3; Lacor et al. 2004b, 2005, 2007). These receptors are critical for synap- 
tic plasticity, and their down-regulation likely accounts for glutamate-initiated CREB 
phosphorylation following ADDL exposure (Tong et al. 2001). Interestingly, ADDLs 
also trigger a decrease in EphB receptors, which in spines are associated with NMDA 
receptor signaling. With respect to spine shape, ADDLs cause the short, club-like struc- 
ture of controls to be supplanted by long, spindly spines (Fig. 4). This type of aberrant 
spine geometry is known to exhibit low glutamate receptor levels (Matsuzaki et al. 
2001), consistent with the impact of ADDLs. Most significantly, spines that are aber- 
rantly long and spindly have been linked to mental retardation and prionoses (Fiala 
et al. 2002). Although predicted from the Arc response, the mechanism underlying 
the ADDL-induced shift in spine shape is not yet known. EphB could be involved be- 
cause of its established relation to spine morphogenesis and association with Rho/Rac 
signaling pathways, and, in fact, ADDLs affect Rac activity (Chromy et al. 2003). Addi- 
tionally, the spine shape change induced by ADDLs is accompanied by rearrangements 
in drebrin and spinophilin, which are actin-regulating proteins. While Arc induction 
and its predicted consequences in response to ADDLs are very intriguing, oligomers 
also impact other important signaling pathways germane to synaptic plasticity (Tong 
et al. 2001; Vitolo et al. 2002; Wang et al. 2004a; Puzzo et al. 2005). Overall, a compre- 
hensive understanding of the relationships between ADDLs and signaling remains an 
important goal. 


Coupling of ADDLs to major features of AD neuropathology 


Synapse loss and neuron death 


Studies with transgenic animals have strongly suggested that ADDLs might be respon- 
sible for synapse elimination (Mucke et al. 2000), an aspect of neuropathology that in 
humans affords the best correlate of dementia (Masliah et al. 1993). This possibility has 
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been verified directly in cell biology experiments. After 24 hours of exposure to AD- 
DLs, the spiny post-synaptic specializations manifest a significant decrease in density, 
with an overall loss of spines equaling 50%. Spine loss of this magnitude also has been 
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reported for hAPP transgenic mice (Jacobsen et al. 2006). The original observations of 
synapse loss in transgenic mice correlated the pathology with soluble rather than in- 
soluble forms of AB (Mucke et al. 2000) and implicated a pathogenic role for the protein 
tyrosine kinase Fyn (Chin et al. 2005). Other tg mouse studies have reported decreased 
levels of Arc (Palop et al. 2005), consistent with degeneration of synaptic terminals. 
Interestingly, an early stage of synapse degeneration in AD may include adaptive en- 
largement of pre-synaptic terminals (Mukaetova-Ladinska et al. 2000; Scheff and Price 
2003), a response also observed in hippocampal neurons exposed to ADDLs (Fig. 5). 
After short-term exposure, neurons exhibit bigger boutons as well as longer spines. At 
longer times, when spine levels have decreased 50%, the targeted neurons show a near- 
complete elimination of drebrin, another F-actin binding protein typically localized to 
spines (Fig. 4). Abnormalities in drebrin similarly manifest in hAPP transgenic mice 
(Calon et al. 2004). 

The degeneration of synapses ultimately is superseded by neuronal degeneration 
and death, observed in organotypic slice cultures (Lambert et al. 1998). Neuronal loss 
is a significant characteristic of AD. In culture, knockout of Fyn is neuroprotective, 
indicating the pathogenic role of Fyn is shared in mechanisms responsible for death as 
well as synapse loss. 


Fig. 5. ADDLs induce a shift toward bigger synaptic boutons. Images obtained by confocal 
microscopy of 6-hr ADDL (500 nM) and vehicle-treated hippocampal neurons illustrate that 
ADDLs increase the size of synaptophysin (Sigma, 1:500) immunolabeled puncta throughout 
the neuropil. Total number of pixels/field captured increased significantly after ADDL treat- 
ment (p < 0.02); however, this increase was not accompanied by an increase in the number of 
puncta/field (p > 0.05). The increase in the total number of labeled pixels/field can be explained 
by a significant shift toward larger sized puncta (p < 0.002). The percentage distributions of 
SVP-positive puncta in the different size categories are represented in the three-dimensional pie 
charts obtained from 10 different fields collected per group of treatments. Synaptically bound 
ADDLs might be involved in reactive synaptogenesis 


ADDLs - Molecules that Disrupt Memories in Alzheimer’s Disease 171 
Tau phosphorylation 


In the revised amyloid cascade hypothesis, AB oligomers ultimately instigate formation 
of neurofibrillary tangles (Hardy and Selkoe 2002). Increased brain levels of soluble 
Af directly correlate with neurofibrillary tangle (NFT) density in AD patients (McLean 
et al. 1999) and Af oligomers have been shown to activate glycogen synthase kinase-3B 
(Hoshi et al. 2003), one of the kinases that appears to be involved in pathological tau 
hyperphosphorylation. However, direct demonstration that tau hyperphosphorylation 
could indeed be induced by Af oligomers has been lacking. Our recent results show that 
both synthetic and AD brain-derived ADDLs instigate marked increases in neuronal 
levels of AD-type tau phosphorylation, establishing that ADDLs are also coupled to 
this major facet of AD pathology (Fig. 6). 


non-AD extract 


Fig. 6. ADDLs induce AD-type tau hyperphosphorylation. Vehicle, 500 nM synthetic ADDLs, 
1 mg/ml soluble extract from non-AD or 1 mg/ml soluble extract from AD brains were added for 
four hours at 37 °C to cultures of highly differentiated rat hippocampal cells (21 DIV) grown in 
neurobasal medium supplemented with B27. After fixation, cells were immunolabeled overnight 
at 4°C with phosho-Thr231 antibody (1:500) diluted in blocking solution. Neurons were then 
rinsed three times with PBS and incubated with Alexa Fluor 488 anti-rabbit IgG secondary 
antibody (1:1,000) diluted in blocking solution for three hours at room temperature. Cells were 
rinsed five times with PBS, coverslips were mounted with Prolong (Molecular Probes), and cells 
were visualized on a Nikon Eclipse TE 2000-U fluorescence microscope. Images were digitally 
acquired using MetaMorph software (Universal Imaging Corporation). Quantitative analysis of 
the data from multiple experiments, using Image J software (NIH Windows version), indicated 
that both synthetic ADDLs and AD brain-derived ADDLs increased tau hyperphosphorylation 
at Thr231 in hippocampal neurons by approximately three-fold 
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Blocking tau hyperphosphorylation has been achieved with a monoclonal anti- 
body that targets AB oligomers (but not monomers), which also blocks ADDL attach- 
ment to clustered synaptic binding sites. Abnormal tau hyperphosphorylation also 
was blocked by the Src family tyrosine kinase inhibitor, 4-amino-5-(4-chlorophenyl)- 
7(t-butyl) pyrazol (3,4-D) pyramide (PP2), and by the phosphatidylinositol 3-kinase 
inhibitor, LY 294002. Fyn kinase, a member of the Src family, has been linked to tau 
phosphorylation stimulated by insulin (Lesort et al. 1999) and by toxic preparations of 
AP (Zhang et al. 1996). Fyn signaling also reportedly underlies synaptic and cognitive 
impairments in a transgenic model of AD (Chin et al. 2005). Since ADDL-induced 
tau hyperphosphorylation in hippocampal cultures was manifested in the absence of 
neuronal death, these ADDL-stimulated signaling events may be an early event in AD 
neuronal dysfunction and degeneration. 


Oxidative stress and the cellular basis of memantine action 


We recently identified a fourth important facet of AD-relevant pathology induced by 
ADDLs, namely that ADDL attachment to synapses in hippocampal cultures stimulates 
formation of reactive oxygen species (ROS; Fig. 7). Oxidative damage is a significant 
attribute of the AD-affected brain. Because oxidative stress may result from the over- 
activation of NMDA-type glutamate receptors, we tested whether NMDA receptors 
were involved in ADDL-stimulated ROS formation. As shown (Fig. 7), ROS formation 
was completely blocked by memantine, an open-channel blocker of NMDA receptors. 
Inhibition of ADDL-induced ROS generation also was observed with another NMDA 
channel blocker, MK-801 (Fig. 7), and with antibodies against the extracellular domain 
of NR1 NMDA receptor subunits. 

Since ROS at low physiological levels are necessary components of LTP signaling, 
it is plausible that anomalous over-stimulation could impair mechanisms relevant to 
memory formation. There is evidence, moreover, that high ROS levels are damaging in 
age-related impairment of LTP (reviewed in Serrano and Klann 2004). Therefore, the 
fact that ADDLs instigate marked elevations in neuronal ROS may provide an addi- 
tional mechanism by which ADDLs deregulate signaling pathways involved in memory 
formation. This mechanism may be relevant to the therapeutic action of memantine, 
which recently was approved for AD patients. Because NMDA receptor function itself 
is crucial for memory formation, it has been a mystery why a receptor antagonist 
should be beneficial. By showing that memantine contravenes the pathogenic action 
of ADDLs, we have established for the first time a pathologically specific mechanism 
for the therapeutic value of memantine (Namenda). 


ADDLs at synapses - a hypothetical basis for early AD, 
its diagnosis and treatment 


We began with the premise that a unifying theory for AD must explain why its earliest 
stages selectively target memory formation and must also account for the major facets 
of AD neuropathology. Findings reviewed here strongly support the hypothesis that 
specific loss of memory function in early AD is the consequence of targeting and func- 
tional disruption of particular synapses by ADDLs. In the ADDL hypothesis (Fig. 8), 
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Vehicle +Memantine |+MK-801 


Fig. 7. ADDLs induce neuronal reactive oxygen species (ROS) generation. Vehicle, 500 nM AD- 
DLs, 500 nM ADDLs + 10 uM memantine or 500nM ADDLs + 54M MK801 were added for 
three hours at 37°C to cultures of highly differentiated rat hippocampal cells (21 DIV, grown 
in neurobasal medium supplemented with B27). Generation of ROS was monitored using the 
ROS-sensitive fluorescence dye, CM-H2DCFDA (5 uM, 40 minutes of probe loading). Neurons 
were rinsed three times with warm PBS, then two times with neurobasal media without phenol 
red. Cells were placed in neurobasal media without phenol red and then immediately visualized 
on a Nikon Eclipse TE 2000-U fluorescence microscope. Images were digitally acquired using 
MetaMorph software (Meta Image Series, Universal Imaging Corporation). Quantitative analysis 
of DCF fluorescence data obtained from multiple experiments, using Image J software (NIH 
Windows version), established that ADDLs increased ROS generation in hippocampal neurons 
by 3.5-fold and that this increase was fully blocked by memantine and MK801 


oligomerization of AB42 provides gain-of-function pathogenic ligands that bind specif- 
ically to surface membrane proteins localized in synaptic spines. The affected synapses 
are those critically placed in memory circuits, and they are targeted because of local- 
ized ADDL production as well as adventitious ADDL capture by synapticallylocalized 
toxin receptors. As pathogenic agonists, ADDLs disrupt proper regulation of the spine 
cytoskeleton and thereby disrupt spine geometry and receptor trafficking. These spine 
deficiencies impair synaptic information storage. When the net impact of deficiencies 
in spine cell biology passes a critical threshold, the result is failed memory formation. 
Moreover, results that strongly substantiate the ADDL hypothesis have been obtained, 
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Fig. 8. Attack on synapses by ADDLs - a unifying mechanism to account for the specificity of 
AD for memory loss and major facets of neuropathology 


establishing coupling between ADDLs and synapse loss, neuron death, tau hyperphos- 
phorylation and oxidative damage, the major pathological features of AD. 

Given their putative involvement in AD pathogenesis, ADDLs provide significant 
opportunities for diagnostics and therapeutics. The large elevation of ADDLs in AD- 
CSF suggests that diagnostics may some day rely on ADDL-based assays. With respect 
to ADDL-based therapeutics, the most appealing approach is to target the pathogenic 
ADDLs themselves, not the cellular molecules that generate them. Given the soluble 
nature of the target, passive vaccines that specifically neutralize ADDLs are especially 
appealing and may obviate the side-effects and variable responses to active fibril- 
oriented vaccines observed in clinical trials (Ferrer et al. 2004). It also may be feasible 
to discover small molecules that could act as lead compounds for anti-oligomerization 
drugs (Chromy et al. 2003; De Felice et al. 2004; Wang et al. 2004b; Walsh et al. 2005). 
Given that ADDLs at least indirectly affect NMDA receptor signaling pathways and that 
the AD-drug Namenda is an NMDA receptor antagonist (Witt et al. 2004), prospects 
seem likely for new and more effective receptor-directed drugs to emerge from ADDL- 
based screens. Finally, because ADDLs are pathogenic ligands that attack specific 
synapses, the search for specific ADDL receptors as potential drug targets is actively 
underway. 


Note This chapter is adapted from WL Klein, “Synaptic targeting by AB oligomers (ADDLs) as 
a basis for memory loss in early Alzheimer’s disease”. Alzheimer’s and Dementia, 2:60-72, 2006, 
with permission from the Alzheimer’s Association. 
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